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DELIVERY OF FUNCTIONAL PROTEIN SEQUENCES 
BY TRANSLOCATING POLYPEPTIDES 



FIELD OF INVENTION 

5 

The present invention relates to methods for translocating polynucleotides and 
polypeptides between ceils. More particularly, the present invention relates to use of 
translocating proteins to deliver a cell process-modifying molecule into the cell where 
the cell process-modifying molecule interacts specifically with a responsive target 
10 site. 

BACKGROUND OF THE INVENTION 

Translocating proteins are defined by their ability to cross biological 
membranes, such as cell manbranes. A number of translocating proteins, have been 
described, including VP22 from Herpes Simplex Virus type 1 (G. Elliot and P. 
15 O'Hare, Cell SS, 223-233 (1997)), a fragment of the Antennapedia protein from 
Drosophila (An^) (D. Derossi et al.. Journal of Biological Chemistry 262, 10444- 
10450 (1994)), and Protein H from Streptococcus pyogenes (Axcrona et al., 
Manuscript in preparation (1999)). 

Antennapedia is a homeoprotein with a DNA binding domain composed of 
20 three alpha helices with a beta-turn separating helix 2 and 3 . Experiments have 

demonstrated that a 16 amino acid pqjtide corresponding to the third heUx, named 
Antp, can translocate across membranes and accumulate in the cytoplasm and 
nucleus (Derossi et al, suprd). This peptide is intemahzed at a temperature as low as 
4°C, suggesting that endocytosis is not responsible for the internalization of the 
25 peptide. In addition, since translocation does not require classical endocytosis, Antp 
does not travel tiirough the endosomal and lysosomal compartments. Therefore, Antp 
is resistant to proteolysis and has enhanced activity in most cellular compartments (D. 
Derossi et al., J Biol Chem 221:18188-18193, 1996). 



30 



Recent ecperimraits showing that a reveree helix (i.e. the reverse primary 
sequence) and a helix composed of D-enantiomers can transverse plasma membranes 
at 4°C suggest tiiat internalization of Antp involves the formation of inverted micelles 
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in the phospholipid bilayer, making entry into cells receptor-independent and energy 
jfree (H, Hall et al.. Current Biol ^:SS0-5B7, 1996). 

The usefulness of Antp as a vector peptide has been proven successful by 
genetically fusing Antp to various peptides of interest (F. Perez et al., J Cell Sci 
5 122:717-722, 1992; F. Perez et al., Mol Endocrinol SilllB-Sl , 1994; and A. 
Prochiantz, Curr Opinion Neurob 6:629-634, 1996) or by covalent linkage via 
cysteine residues (D. Derossi et al., supra). Internalization of peptides as large as 41 
amino acids and of charged phosphopeptides (B. Allinquant et zi,^JCell Biol 
122:919-927, 1995) has been demonstrated in neuronal cells. In each case, the 
10 sequences fused to Antp retained their expected biological functions. Furthermore, 
Antp is the only translocating peptide that has been used to deliver oligonucleotides 
{up to 45 nucleotides in length) to cells in culture (CM. Troy et al,, J Neuro 16 253- 
61, 1996; G. Elliot et al., J Virol 172:6448-6455, 1998). 

Protein H is a surface antigen of the human pathogen Streptococcus 
15 pyrogenes. Protein H is taken up by B- and T-lymphocytes and translocated to the 
nucleus. In contrast to other translocating proteins, which appear to have no effect on 
cellular function, protein H has a cytostatic effect thought to be the result of its 
association with the nuclear proteins SET and hnRNP A2/B1 (D. Derossi et aL, 
supra). To date, the translocation of Protein H coupled to another molecule has not 
20 bcCT demonstrated. 

The best studied of the translocating proteins is the Herpes Simplex Virus 
protein VP22, which has the unique ability to translocate between cultured 
mammalian cells. When cells are transfected with a plasmid encoding the VP22 
protein, the expressed protein accumulates in the cytoplasm of transfected cells and, 

25 by translocating across cell membranes, spreads to the surrounding non-transfected 
cells where it accumulates in the nuclei. This process can occur at 4^C and also 
appears to be energy-free and independent of endocytosis. When protein trafficking 
though the cell is blocked using Brefeldin A, export of VP22 can still occur. Studies 
of cytoskeletal elements during VP22 trafficking suggest that the actin cytoskeleton 

30 may be involved in export or import of VP22 (Elliot and O'Hare, supra). 
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Delivery of several functional VP22 fusion proteins has been described, 
including VP22-p53 (A. Phelan et al., Nature Biotechnology ig:440-443, 1998)) and 
VP22.thymidine kinase (M.S. Dilber et aL, Gene Therapy §:A2-2l, 1999), At least 
twenty different mammalian cell types can take up a functional VP22-GFP fusion 
5 protein (Elliot and 0*Hare, supra; Aints A., et al., 7. Gene Med. 1:275-9, 1999; and 
Wybranietz W. A. et al, J. Gene Med. 1:265-274, 1999), including mouse skeletal 
myoblasts that are refractory to conventional transfection techniques (Derer W. et al., 
J, MoL Med, TV. 609-6138, 1999), 

Transfection of cells with plasmid DNA has been an invaluable tool for the 
smdy of biological systems, A variety of transfection methods {e.g. lipids, calcium 
phosphate) exist in the marketplace; however, these methods rarely result in more 
than 50% of cells expressing a gene carried on a plasmid with which the cells are 
transfected. Since most cells do take up exogenous DNA, inefficient transfections do 
not appear to be due to inability of the DNA complex to enter the cell. The majority 
of DNA is internalized by endocytosis with very little of the internalized DNA ever 
reaching the cytoplasm or nucleus where expression takes place. Indeed, observations 
of directly injected lipid-DNA complexes suggest that movement from the endosomes 
to the cytoplasm and nucleus is the most important limitation to successful 
transfections (L H. Richardson et aL, Proc. Natl Acad. 5cz.^:3137-3141, 1995). 
Consistent with this observation, peptides with membrane fusion activity, like the 
fusogenic peptide of hemagglutinin (J, Zabner et al. Journal of Biological Chemistry 
22Q; 18997-9007, 1995), or a nuclear targeting sequence (M, Wilke et aL, Gene 
Therapy 3, 1 133-1 142 (1996)) can increase transfection efficiencies in some cases. 

Thus, there is a need in the art for new and better methods for modulating 
25 expression in cells of target genes and for transfection reagents and methods of their 
use to overcome the major blocks to expression of transfected genes, i.e., degradation 
in the endosomes and the inability of DNA to enter the cell nucleus. 



10 
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BRIEF DESCRIPTION OF THE INVENTION 

The present invention overcomes these problems in the art by providing 
method(s) for modulating a cellular process in a cell in culture by contacting such a 
cell with a cell process-modifying molecule attached to a translocating polypeptide 
5 under suitable conditions, whereby the ceil process-modifying molecule is 

translocated into the cells in culture and interacts specifically therein with a target site 
responsive to the cell process-modifying molecule, thereby modulating a cellular 
process in the cell 

In another embodiment, the present invention provides method(s) for 
1 0 transfecting a cell in culture with a target gene by contacting the cell under suitable 
conditions with a polynucleotide comprising the target gene attached to a 
translocating polypeptide, whereby the cell is transfected with the target gene. 

In still another embodiment, the present invention provides method(s) for 
modulating expression of a target gene product in a cell in culture that is transfected 
1 5 with the target gene under control of one or more regulatory elements by contacting 
the cell under suitable conditions with one or more regulatory agents attached to a 
translocating polypeptide, whereby the one or more regulatory agents are translocated 
into the cell and interact therein with the one or more regulatory elements, thereby 
modulating expression of the target gene product by the celL 

20 In yet another embodiment, the present invention provides vector(s) 

comprising a polynucleotide encoding a cell process-modifying molecule attached to 
a translocating polypeptide. 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 is a schematic drawing showing pFIN4/IacZ, which has an 
25 intervening sequence (inv) flanked by Flp recognition sites ifrt) separating the CMV 
promoter and P-galactosidase gene (lacZ), Interaction of Flp recombinase with pFIN4 
results in the removal of the inv sequaace and expression of p-galactosidase. 
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Figures 2A-D are schematic representations of the process by which a fusion 
protein composed of VP22, an anti-ATF-2 single chain antibody (sFv), and VP16 is 
delivered to the nucleus of a ceil where it binds ATF-2 and activates transcription. 
Figure 2A shows the ATF-2/LexA DNA binding domain (DBD) fusion protein binds 
5 the LexA operator (Op) upstream of the minimal TK promoter and the luciferase 
reporter gene, but does not activate transcription. Figure 2B shows that the ATF-2 
sFv-VP16 fusion protein binds ATF-2 and activates transcription. Figure 2C shows 
that the CREB sFv-VP16 fusion protein do^ not bmd ATF-2 and cannot activate 
transcription. Figure 2D shows that the fusion protein composed of VP22, the ATF-2 
10 sFv, and VP! 6 is delivered to the nucleus, where it binds ATF-2 and activates 
transcription. 

Figures 3 A-C show the attachment of a translocating protein (VP22) to an 
oligonucleotide (oligo) by generation of a bifunctional linker molecule. Figure 3A 
shows the chemical structure of a phenylboronic acid (PBA)-adapted nucleotide 
15 (PBA-dUTF). Figure 3B shows the chemical structure of a salicyUiydroxamic acid 
(SHA)-adapted amino acid (R = lysine). Figure 3C shows the reaction of the PBA- 
adapted nucleotide and the SHA-adapted amino acid to create a bifunctional linker 
molecule that attaches the oligonucleotide to VP22. 

Figure 4 is a schematic diagram illustrating a VP22-T7 RNA polymerase (T7 
20 pol) expression system. VP22-T7Pol accumulates in the nucleus upon exogenous 
addition to tissue culture cells. In the nucleus, the VP22-T7 pol fusion protein 
recognizes the T7 promoter and activates transcription of gene X, 

Figure 5 is a m^ of vector pVP22/Afyc-His, which contains the T7 promoter 
(T7), VP22 open reading frame (VP22), a multiple cloning site, a myc epitope (myc), 
25 and a polyhistidine tag (6xHis). 

Figures 6A and B show the nucleotide sequence of vector pVP22/Myc-His 
(SEQIDNO:!). 

Figure 7 is a map of pVP22/Afyc-His-TOPO(g) vector, which contains the T7 
promoter (T7), VP22 open reading frame (VP22), a multiple cloning site modified by 
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covalent coupling of the Vaccinia Virus Topoisomerase I protein (T) to linearized 
vector DNA, a myc epitope (myc), and a poiyhistidine tag (6xHis), A PCR product 
with a single 3' A base overhang can be inserted into the topoisomerase-adapted site. 

Figures 8A and B show the nucleotide sequence of pVP22/Myc-His-TOPO® 
5 vector (SEQ ID NO:2), 

DETAILED DESCRIPTION OF THE INVENTION 

In accordance with the present invention, there are provided method{s) for 
modulating a cellular process by contacting a cell in culture under suitable conditions 
with a cell process-modifying molecule attached to a translocating polypeptide, 
10 whereby the cell process-modifying molecule is translocated into the cell and interacts 
specifically tiierein with a target site responsive to the cell process-modifying 
molecule, thereby modulating a cellular process in the cell. 

As used herein, the term "translocating protein" means a protein, polypeptide, 
or functional fragment tiiereof, that crosses biological membranes. Translocating 

15 proteins, polypeptides, functional fragments and homologues thereof, possess the 
following properties: resistance to proteolysis, receptor-independent penetration of 
cell membranes, and substantially energy-free penetration of cell membranes. 
Exemplary translocating proteins that can be used in the invention methods and 
constructs include VP22 from Herpes Simplex Virus type 1 (G, Elliot and P, OTHare, 

20 1997, supra\ a fragment of the Antennapedia protein from Drosophila (An^) (amino 
acids 43 through 58) (5'-RQIKIWFQNRRMKWKK-3') (SEQ ID N0:21) (Axcrona 
et aL, supra 1999), Protein H from Streptococcus pyogenes (D. Derossi et al., J, Biol 
Cfeew., 221:18188-93, (1996)), and the like. While each translocating protein has 
distinct properties, the general application of translocating proteins is to deliv^ other 

25 molecules to cells, either by constructing a fiision molecule (e.g., a fusion protein) or 
by attaching the desired molecule to the translocating protein (e.g, covalently or by 
means of a linker). In fusion proteins the translocating protein can be located either in 
the N-terminal or the C-terminal position. The preferred fusion protein or polypeptide 
for use in practice of the invention methods is a VP22 polypeptide. 



wo 00/58488 PCT/USOO/08571 



The term ^'VPZl polypeptide" is used herein to refer to the herpes viral VP22 
protein, as well as to functional fragments thereof, that have the translocating 
properties of the intact protein. In addition, the term **VP22 polypeptide*' as used 
herein encompasses homologues of VP22 protein, such as those derived from 
5 varicella zoster vims (VZV), equine herpesvirus (EHV), bovine herpesvirus (BHV), 
and the like, and transport-active (i.e. "functional") fragments, mutants and chimCTc 
combinations thereof. 

In particular, VP22 polypeptide encompasses polypeptides corresponding to 
amino acids 60-301 and 159-301 of the full HSVl VP22 sequence (1-301), whose 
10 sequence is disclosed in Figure 4 in WO 97/05265. Homologous proteins and 

fragments based on sequences of VP22 protein homologues from other herpes viruses 
are described in U.S. Patent 6,017,735, which is incorporated herein by reference in 
its entirety. 

The term "fusion protein" as used herein refers to two distinct proteins, 
polypeptides, peptides, and/or fragments not normally associated with each other in 
nature that are encoded by the same reading frame, resulting in the two or more 
distinct proteins and/or fragments being "fused" together. The fusion proteins used in 
invention methods are produced from nucleotide sequences encoding a translocating 
polypeptide, e.g., a VP 22 polypeptide, and another fimctional peptide in the same 
reading frame. The polynucleotide encoding the fusion protein may also contain in 
the same reading frame additional peptide or polypeptide sequences useful in the 
invention methods, such as epitope-tag encoding sequences, affinity purification-tag 
encoding sequences, additional functional protein encoding sequraces, and the Uke, or 
a combination of any two or more thereof 

25 In one embodiment, the invention provides method(s) for transfecting a cell 

with a target gene by contacting the cell under suitable conditions with a 
polynucleotide comprising the target gene attached to a translocating polypeptide, 
whereby the cell is transfected with the target gene. As used herein, the term 
"transfected" means that a gene translocated into a cell in culture due to the 



15 



20 



wo 00/58488 



PCT/USOO/08571 



8 

translocating properties of an att^hed translocating polypeptide is expressed in the 
cell, at least transiently, i,e., the cell is transiently transfected with the target gene. 

The size of polynucleotide that can be transfected into a cell according to the 
invention methods ranges from about 10 nucleotides to about 10 kilobases (kb). For 
5 example, polynucleotides in the range from about 20 nucleotides (nt) to about 5 kb, or 
from about 100 to 500 nt can be transfected into cells using the invention methods. 
Generally, the target polynucleotide is transiently transfected into a cell population in 
culture, for example, in a monolayer or tissue culture. None of the conventional 
means used to assist transfection or transduction is required, such as electroporation, 

1 0 infection employing viral vectors, calcium phosphate transfection, dextran sulfate 

transfection, lipofection, cytofection, particle bead bombardment, and the like. Instead, 
all that is required is contact (i.e., co-culture) of the cell population to be transfected with 
purified translocation protein or with synthetically prepared translocatmg protein 
having a polynucleotide of interest attached thereto by means of a covalent bond or 

1 5 linker molecule, as described herein. Any type of prokaryotic or eukaryotic cell in 
culture can be transfected using invention methods, for example, mammahan, yeast, 
insect or plant ceils. However, it is presently preferred that the cells in culture be a 
monolayer of mammalian or insect cells. 

In invention methods wherein a translocating protein is attached to plasmid 
20 DNA (i.e., via either covalent or non-covalent interactions), the DNA can be delivered 
to the nucleus for gene expression. Delivery of DNA using translocating proteins as 
described herein is an extremely valuable research tool. In up to 100% of the cells 
into which a desired poljmucleotide containing an open reading frame (e.g*, a 
polynucleotide contained in a plasmid) is delivered by an invention translocating 
25 protein, the polynucleotide is internalized, transported to the nucleus, and the open 
reading frame is then expressed, thus creating a homogeneous population of cells for 
studying such cell processes as cell cycle regulation, transcription regulation, 
translation regulation, and the like. 

In another embodiment according to the invention, method(s) are provided for 
30 modulating expression of a target gene product in a cell in culture that contains a 
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target gene under control of one or more regulatory elements. In this embodiment, the 
invention method is practiced by contacting the ceil in culture under suitable 
conditions with one or more regulatory ^ents attached to a translocating polypeptide, 
whereby the one or more regulatory agents are translocated into the cell in culture and 
5 interact therein with the one or more regulatory elements, thereby modulating 
expression of the target gene product by the cell. 

For example, a polynucleotide attached to a translocating polypeptide, such as 
VP22, can be translocated into the nucleus of the cell for expression of all or a part of 
the polynucleotide* In one embodiment, the polynucleotide comprises an open reading 
1 0 frame encoding a protein of interest, such as a target gene product or reporter gene 
product. Alternatively, the polynucleotide can be a vector (e.g., a supercoiled 
plasmid) containing a cloned open reading frame that encodes a target gene. 

It has been discovered that the translocating protein and attached cell process- 
modifying molecule can be directed to the cytoplasm for expression as well as to the 

15 cell nucleus of the population of cells in culture if the translocating protein is attached 
(e.g., fused) to a nuclear export signal (NES), Signals for the export of proteins from 
the nucleus have recently been described. Analysis of PKI (heat stable inhibitor of 
cAPK, cyclic AMP-dependent protein kinase A) (Y. Wang et aL, Gene Therapy 
432-441 (1997)) and the HIV Rev protein (W. Wen et al., Ce//S2, 463-473 (1995)) 

20 has revealed a leucine rich sequence that is sufficient to direct heterologous sequences 
out of the nucleus and into the cytoplasm. Furthemiore, fusion of the NES to a 
heterologous protein that includes the canonical S V40 larger T antigen NLS has been 
shown to result in the distribution of the protein between the cytoplasmic and nuclear 
compartments (Wang et al., supra). Similarly, the Rev protein contains sequences for 

25 both nuclear import and export and is found in both the cj^oplasmic and nuclear 

compartments of cells (Wen, et al, supra). Thus, incorporation of a NES is a potential 
method to modulate the nuclear targeting of translocating proteins, such as VP22, 
especially since the PKI NES can partially counteract the very strong signals of the 
SV40 NLS. When attached to a nuclear export signal, the translocating polypeptide 

30 and any attached polynucleotide can be stably introduced into the cytoplasm as well 
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as the nucleus of the cells in culture, thereby accomplishing partition of the 
polynucleotide between cellular compartments. In the cytoplasm, regulation of 
expression of a gene contained in the polynucleotide can be regulated using invention 
methods as described herein, 

5 Nuclear export signals suitable for use in the practice of the invention are 

known in the art and include the nuclear export signals derived from the HIV Rev 
protein or the heat stable inhibitor of cAPK, and the like. In many cases, inclusion of 
a nuclear export signal into the translocation protein-containing construct can be used 
to stably integrate a target gene of interest into the genome of the cells in culture. 

10 A cell in culture can be contacted with a translocating protein attached to a 

cell-modifying molecule according to the present invention by a variety of methods. 
In the one method, an expression cell population transfected with a polynucleotide 
encoding the translocating protein fused to the cell-modifying molecule (e.g., as a 
fusion protein) is mixed and co-cultured with a target cell population that 

1 5 spontaneously takes up the expressed translocation protein with attached cell- 
modifyiag molecule. The expressed protein accumulates in the cj^oplasm of the 
transfected expression cells and, by translocating across cell membranes, spreads to 
the surrounding non-transfected cells where it accumulates in the nuclei. For 
example, the expression cell can be a prokaryotic cell line, such as E. coli, and the 

20 target cell line can be any eukaryotic cell line, for example a mammalian cell line, 
such as CHO or COS, or an insect cell line, such as Drosophiia S2, and the like. 

Alternatively, the expression cell population can be cultured under conditions 
that promote expression of a transfected gene, a cell lysate can be prepared of the 
transfected expression cell population and the lysate can be applied to a cultured 

25 target cell population using methods known in the art and as described in the 

Examples herein. When the translocation protein is VP22, the VP22 or fusion protein 
containing VP22 will translocate to the nuclei of substantially 100% of the cell 
population. It is also possible to culture the target cells with purified translocation 
protein-containing molecules or with synthetically prepared molecules containing the 

30 translocating protein attached to a polypeptide or nucleotide by means of a covalent 
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bond or linker, as described herein. The translocating polypeptide and attached 
molecule will translocate to an entire cell population in culture within about 10 
minutes to about 72 hours, more typically within about 10 minutes to about 50 hours; 
preferably within about 10 minutes to about 24 hours. However, in some cases, no 
5 more than about 1 0 minutes is required for uptake of a translocating polypeptide and 
attached molecule by an entire cell population. 

Fusions of translocating polypeptides with known DNA binding proteins can 
also be used to dehver DNA containing an open reading frame (e.g,, a plasmid) to 
tissue culture cells. In this embodiment of the invention methods, the DNA binding 

1 0 protein acts as a linker for attaching the translocating protein to the cell-modifying 
polynucleotide (i.e. a plasmid containing a polynucleotide that acts to modify a cell 
process). Examples of protein linkers that may be fused to translocating proteins for 
the delivery of polynucleotides, such as plasmid DNAs, include histone 1 (HI) protein 
(M. Wilke et ah, supra and Niidome, et al., J. BioL Chem. Zll. 15307-15312 (1997)) 

15 and the non-histone protein HMG-17 (high mobility group 17) (S. V. Zaitsev, et al., 
Gene Ther, 4, 586-592 (1997)). HMG-17 interactions with DNA have been studied in 
depth and demonstrate that HMG-17 interacts with DNA in a non-cooperative, non- 
specific, and reversible manner (M, Bottger et al. Arch. Geschwulstforsch 265- 
270 (1990)). In each case, either the entire DNA binding protein, or a functional 

20 fragment thereof (i.e. a fragment having DNA binding activity) may be used. 

It may be preferred to complex the DNA with a reagent, such as 
polyethylenimine (PEI), that condenses and neutralizes the charged DNA prior to 
mixing with the translocating protein, or translocating protein-DN A binding domain 
fusion. 

25 Alternatively, if a shorter peptide linker is advantageous in the particular 

system used, the peptide linker may be fused to a translocating protein either as a 
chemically synthesized peptide or as a nucleotide encoding a fusion protein to be 
expressed in a prokaryote expression system. Examples of short peptide sequences 
that may be fused to a translocating protein either as a chemically synthesized peptide 

30 or as a fusion protein include polylysine sequences and sequences containing three or 
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more repeats of the peptide sequence LARL, for example, LARL-LARL-LARL (SEQ 
ID NO:3) (J. D. Fritz et al., Hum, Gene Then 7:1395-1404 (1996)). In some cases, 
from three to about 100 repeats of the LARL sequence may be used as a linking 
peptide as described herein; typically from 3 to about 50 repeats, with 3 up to about 
5 20 repeats being presently preferred. 

A preferred linker for attaching a translocating protein to a cell-modifying 
polynucleotide is the Vaccinia virus topoisomerase I protein, or a mutant form 
thereof, which allows the formation of stable topoisomerase I-DNA conjugates. 
Vaccinia DNA topoisomerase, a 314 aa virus-encoded eukaryotic type I 

1 0 topoisomerase (I), binds to duplex DNA and cleaves the phosphodiester backbone of 
one strand (S. Shuman and B, Moss, Proc, Natl Acad, Set. USA M: 7478-7482 
( 1 987)). The enzyme exhibits a high level of sequence specificity, akin to that of a 
restriction endonuclease. Cleavage occurs at a consensus pentapyrimidine element 
5'-(C/T)CCTT-3' in the scissile strand (S. Cheng et al.. Proa Natl. Acad. Set USA 

15 21: 5695-5699 (1994); J.M, Clark, Nucleic Acids Res, J^: 9677-9686 (1988) ; and 
S.G. Morham and S J. Shuman, Biol. Chem. 2^: 15984-15992 (1992)). In the 
cleavage reaction, bond energy is conserved via the formation of a covalent adduct 
between the 3' phosphate of the incised strand and a tyrosyl residue (Tyr-274) of the 
protein, Vaccinia topoisomerase can religate the covalently held strand across the 

20 same bond originally cleaved (as occurs during DNA relaxation) or it can religate to a 
heterologous acceptor DNA and thereby create a recombinant molecule. When 
attached to an invention translocating protein, the Vaccinia topoisomerase I linker will 
attach to a double stranded oligonucleotide having single 5' A base overhangs, such 
as are created in Taq mediated PCR. Such topoisomerase I-DNA conjugates may 

25 then be introduced into cells. 

Figure 7 illustrates a suitable vector wherein Vaccinia topoisomerase I linker 
is used to attach a translocating protein to a double-stranded oligonucletide of interest 
Vector pVP22/Myc-His TOPO® (SEQ ID NO:2), utilizes Vaccinia topoisomerase I 
linker to attach VP22 to a double stranded PGR product (i.e., a cell-process modifying 
30 oligonucleotide) having single 5' A base overhangs to create a VP22 fusion with 
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vector DNA. Such topoisomerase I-DNA conjugates may then be introduced directly 
into cells. 

In another embodiment, a translocating protein is used to increase the 
efficiency of plasmid delivery in conjunction with a cationic liposome. Fusion of a 
5 translocating protein to a protein domain that readily associates with a cationic 
liposome, for example a hydrophobic transmembrane domain or a 
glycosylphosphatidylinositol (GPI) anchor, facilitates interaction at the lipid-DNA 
interface. Following endocytosis of the liposome-DNA complex, the translocating 
protein will translocate the complex through the endosomal membrane and into the 
10 cell cytoplasm and, eventually, to the nucleus for gene expression. Translocating 
proteins may also be used to enhance transfection efficiencies in conjunction with 
compounds, such as chloroquine, that inhibit lysosomal hydrolases (Niidome et al., J. 
Biol CAem., 222:5307-12, 1998). 

Polynucleotides encoding fusion proteins may be constructed by standard 
15 molecular biology techniques (J. Sambrook, E. F. Fritsch and T. Maniatis (1989). 
Molecular Cloning, A Laboratory Manual Cold Spring Harbor Laboratory Press. 
Cold Spring Harbor, NY), transfected into tissue culture cells and tested for 
translocation ability by use of suitable methods, e.g., immunofluorescence, as are 
known in the art. See also the methods discribed in the Examples herein. 

20 Inducible systems are xxsed to study the phenotypic effects of protein 

expression. Since inducible systems allow expression of a protein on demand, such 
systems can be used as a research tool to study cell processes and even to enable the 
expression of toxic proteins in tissue culture. Current systems for inducible 
mammalian expression use transcriptional elements from diverse organisms, for 

25 example, Kcoli (U. Fischer et al.. Cell S2:475-483 (1995)), or Drosophila (M. Gossen 
et al., 77B5 15:471-475 (1993)) that are comtitutively expressed in a cell line along 
with a vector that contains a promoter responsive to transcriptional regulators. 
Addition of an effector molecule causes binding of the transcriptional regulators to the 
inducible promoter, thus turning on gene expression. 



wo 00/58488 



PCT/USOO/08571 



14 

The present invention provides a novel approach to this problem by providing 
method{s) for modulating expression of a target gene product in a mammalian cell 
transfected with the target gene under control of one or more regulatory elements. In 
the invention method, the target cell is contacted under suitable conditions with one or 
5 more regulatory agents attached to a translocating polypeptide, whereby the one or 
more regulatory agents are translocated into the mammalian cell and interact therein 
with the one or more regulatory elements, thereby modulating expression of the target 
gene product by the cell. The translocating polypeptide used in invention methods for 
modulating expression of a target gene product can be any of the translocating 
1 0 polypeptides disclosed herein, but is preferably a VP22 polypeptide. 

The regulatory agent can be a polynucleotide, a protein or polypeptide, or a 
small molecule. For example, the regulatory element can be a promoter operatively 
linked to a target gene wherein translocation of the regulatory agent transactivates 
expression of the target gene product by the promoter. It is preferred that the 
1 5 regulatory agent be specific for the promoter, such as a polymerase specific for the 
promoter. 

An exemplary inducible system according to the present invention utiUzes the 
RNA polymerase of bacteriophage T7, which has been used to direct gene expression 
in maimnalian cells. Expression of T7 RNA polymerase (T7 RNAP) by Vaccinia 

20 virus (A. Ramsey-Ewing and B. Moss, J. Biol Chem, 221:16962-16966, 1996; T. R. 
Fuerst et al., Proc, Natl, Acad. Sci, 8^:8122-8126, 1986) or in a stable cell line (O. 
Elroy-Stein and B. Moss, Proc, Natl, Acad, Set £7:6743-6747, 1990 and A. Lieber et 
al.. Nucleic Acids Res, 12:8485-8493, 1989), or introduction of T7 RNAP protein at 
the time of transfection (X. Chen, et al.. Cancer Gene Then 2:281-289, 1995 and X. 

25 Chen et al,. Nucleic Acids Res. 22:21 14-2120, 1994) promotes specific expression of 
genes that are located downstream of the small T7 promoter. The specificity that T7 
RNAP has for the T7 promoter ensures that the desired gene is expressed and that 
non-specific gene activation does not occur. Expression using T7 RNAP has been 
reported to be very strong, 6-fold higher than the RSV promoter in one case (A. 

30 Lieber et al.. Nucleic Acids Res. 12:8485-8493, 1989). In addition, gene expression 
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can be directed by T7 polymerase either in the nucleus (Lieber, supra and J. J. Dunn 
et al.. Gene 6S:259-266, 1988) or the cytoplasm of cells. These characteristics 
suggested that T7 RNAP can be used to specifically regulate gene expression by the 
addition of a T7 RNAPAT22 fusion protein to cells containing a T7 promoter 
5 construct. Direct delivery of T7 RNAP using VP22 technology allows specific 
control of gene expression and minimizes negative effects of delivery to non-target 
sites. Thus, the methods of the present invention allow for study of the phenotypic 
effects of protein expression on demand. Due to the specificity of the invention 
inducible system, the expression of toxic proteins can also be studied in tissue culture, 

1 0 "Toxic proteins/* as the term is used herein, refers to proteins that have immediate 
intrinsic toxic potential for living systems, including those trans-dominant mutations 
in proteins leading to constituitively active forms of the protein. Thus toxic proteins 
are distinguished from "pro-drug" type molecules that require modification after 
expression to release a toxic potential Non-limiting examples of toxic proteins that 

15 can be used in practice of the invention methods are various oncogene products, such 
as Raf and Ras oncogene products (Reviewed by Avruch et al. Trends in Biology 
19:279-83, 1994). 

Alternatively, the regulatory agent can be a transcription factor specific for the 
regulatory element so that translocation of the regulatory agent transactivates 

20 expression of the target gene product For example, the translocating protein can be 
fused to a DNA binding domain, such as that from the Ga}4 protein, and to a common 
transactivation domain, such as VP16 or B42. In this embodiment of the invention, 
the translocating protein-containing fixsion protein will localize to the nucleus and 
then specifically activate a promoter which contains upstream binding sites for the 

25 DNA binding domain incorporated into the fusion protein. 

"DNA-binding protein(s)" contemplated for use herein belong to the well-known 
class of proteins that are able to directly bind DNA and facihtate initiation or repression 
of transcription. Exemplary DNA-binding proteins contemplated for use herein include 
transcription control proteins (e.g., transcription factors and the like; see, for example, 
30 Conaway and Conaway, Transcription Mechanisms and Regulation^ Raven Press Series 
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on Molecular and Cellular Biology, Vol. 3, Raven Press, Ltd., New York, NY, 1994; T. 
Boulikas, Critical Reviews in Eukaryotic Gene Expression,M2&2)*U7-321, 1994; A. 
Klug, Ge/ie 125:83-92, 1993; W. M. Kmjewska, Int. J. Biochem., 24:1885-1898, 1992.) 

Transcription factors contemplated for use herein as a source of such DNA 
5 binding domains include, e.g., homeobox proteins, zinc finger proteins, honnone 

receptors, heiix-tum-helix proteins, helix-loop-helix proteins, basic-Zip proteins (bZip), 
P-ribfaon factoid, and the like. See, for example, Harrison, "A Structural Taxonomy of 
DNA-binding Domains," Nature^ ^52:715-719. Homeobox DNA-binding proteins 
suitable for use herein include, for example, HOX, STF-1 (Leonard et al, Mol Endo.^ 

10 7:1275-1283, 1993), Mat a-2, INV, and the like. See, also, Scott et al Biochem. 
Biophys. Acta, 2S2:25-48, 1989. It has been found that a fragment of 76 amino acids 
(corresponding to amino acids 140-215 described in Leonard et aL, 1993) containing the 
STF-1 homeodomain binds DNA as tightly as wild-type STF-1 . Suitable zinc finger 
DNA-binding proteins for use herein include Zif268, GLI, XFin, and the like. See also, 

15 Klug andRhodes, Trends Biochem. Sti., 12:464, 1987; Jacobs and Michaels, Afevv^/^i, 
2:583, 1990; and Jacobs, ^MfiO/., 11:4507-4517, 1992. 

The DNA-binding domain(s) used in the invention methods can also be obtained 
from a member of the steroid/thyroid honnone nuclear receptor superfamily, or be 
substantially the same as those obtained from a member of the superfamily. The 

20 DNA-binding domains of substantially all members of the steroid/thyroid hormone 
nuclear receptor superfamily are related. Such domains consist of 66-68 amino acid 
residues, and possess about 20 mvariant amino acid residues, including nine cysteines. 
Members of the superfamily are characterized as protems which contain these 20 
invariant amino acid residues. The highly conserved amino acids of the DNA-binding 

25 domain of members of the superfamily are as follows: 
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Cys-X-X-Cys.X-X-Asp*-X-Ala*.X-Gly*- 
X-Tyr*-X-X-X-X-Cys-X-X-Cys-Lys*.X- 
Phe-Phe-X-Arg*-X-X-X-X-X-(X-X-) Cys- 
X-X-X-X.X-(X-X-X-)Cys.X-X.X-Lys.X- 
5 X-Arg.X-X.Cys-X-X-Cys-Arg*-X-X- 
Lys*.Cys-X-X-X-Gly*-Met(SEQIDNO:4); 

wherein X designates non-conserved amino acids within the DNA-binding domain; an 
asterisk denotes the amino acid residues which are ahnost universally conserved, but for 
which variations have been found in some identified hormone receptors; and the residues 
10 enclosed in parenth^is are optional residues (thus, the DNA-binding domain is a 

minimum of 66 amino acids in length, but can contain several additional residues). Such 
DNA binding domains bind to 2-half site recognition sites, as is well knovm in the art to 
transactivate transcription under control of a response element comprising the 
recognition site. 

1 5 The GAL4 DNA binding domain does not interact with a 2-half site DNA 

recognition site. The DNA binding domain of the yeast GAL4 protein comprises at least 
the first 74 amino temiinal amino acids thereof (see, for example, Keegan et al^ Science 
2M:699-704, 1986). Preferably, the first 90 or more amino terminal amino acids of the 
GAL4 protein will be used, for example, the 1 47 amino terminal amino acid residues of 

20 yeast GAL4. 

Another DNA binding domain that can be used in the practice of the present 
invention is the Tet operon. The tetracycline inducible system is well-known in the art 
(see, e.g, Gossen et al.. Proa. Natl Acad, Set 52:5547-5551 (1992); Gossen et al., TIBS 
15:471-475 (1993); Furth et al., Proc Natl Acad. Scl 21:9302-9306, (1994) ; and 
25 Shockett et al., Proc Natl Acad. Scl 22:6522-6526 (1995)). 

Transcription modulating domains are of two types, those that activate 
transcription of a gene sequoice operatively associated with a response element that is 
responsive to the invention system (i.e., transcription activation domains) and those 
that repress or de-activate transcription of a gene sequence operatively associated with 



wo 00/58488 



PCT/USOO/08571 



18 

a response element that is responsive to the invention system (i.e., transcription 
repression domains). The ability of the invention system to activate transcription of 
such a target gene is generally enhanced when the transcription modulating domain 
attached to the translocating protein is a transcription activation domain. Transcription 
5 activation domains contemplated for use in the practice of the present invention can be 
obtained from a variety of sources and are well known in the art. 

Such transcription activation domains are typically derived from transcription 
factors and comprise a contiguous sequence that functions to activate gene expression 
when associated with a suitable DNA-binding domain. For example, suitable activation 

1 0 domains can be obtained from the N-terminai region of members of the steroid/thyroid 
hormone nuclear recq)tor superfamily, from transcription factor activation domains, 
such as, for example, VP16, GAL4, NF-kB or BP64 activation domains, and the like 
(See, for example, M. Manteufifel-Cymborowska, Acta Biochim Pol 46fn :77-89 (1999); 
T. Tagami et al., Biochem BiophysRes, Commun. 252£2):358-63 (1998), W. W^tin, 

1 5 Adv Pharmacol^ 47:89- 112 (2000)). The activation domain presently preferred for use 
in the practice of the present invention is obtained from the C-tertninal region of the 
VP16 protem. 

Transcription repressor domains that can be used in the invention methods 
include those that repress transactivation of gene expression. Exemplary transcription 
20 repressor domains suitable for use as the transcription modulating domain in the 

invention methods include RAFT, CREM, MECP-2, SMRT, NcoR, mSmSA, RAR, TR, 
SMRTR, and the like. 

Another way in which translocating proteins may be used in inducible 
expression systems for modulating expression of a target gene is to create gene 

25 fiisions or in vitro covalent linkage with site-specific recombination sequences, which 
are sequences of nucleic acids that are specifically recognized by a particular site- 
specific recombinase. Site specific recombinases, as the term is used herein, are 
enzymes that catalyze the excision and/or recombination of nucleic acid sequences, 
and may form intermediate complexes with the transfer sequence DNA during the 

30 recombination event. These enzymes recognize a relatively short, unique nucleic acid 
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sequence that serves as a site for both recognition and recombination. Recombinases 
particularly useful in the practice of the invention are those that function in a wide 
variety of cell types because such enzymes do not require any host specific factors 
and do not require ATP to fimction. 

5 Two major families of site-specific recombinases firom bacteria and unicellular 

yeast have been described: the integrase family and the resolvase/invertase family. In 
these recombinases, strand exchange catalyzed by site specific recombinases occurs in 
two steps of (1) cleavage and (2) rejoining, involving a covalent protein-DNA 
intermediate formed between the r^^ombinase enzyme and the DNA strand(s). The 

10 nature of the catalytic amino acid residue of the oizyme and the Une of entry of the 
nucieophile is different for these two recombinase families. For cleavage catalyzed 
by the invertase/resolvase family, the nucieophile hydroxyl is derived fi-om a serine 
and the leaving group is the 3'-0H of the deoxyribose. For the integrase family, the 
catalytic residue is a tyrosine and the leaving group is the 5'-0H. In both 

15 recombinase families, the rejoining step is the reverse of the cleavage step. 

The recombinase activity of Cre has been studied as a model system for the 
integrases. Cre is a 38 kD protein isolated fi-om bacteriophage PL It catalyzes 
recombination at a 34 base pair stretch of nucleic acids called loxP. The loxP site has 
the sequence 5^ATAACTTCGTAT AGCATACAT TATACGAAGTTAT-3^ (SEQ ID 

20 NO: 5; spacer region underlined), consisting of two 13 base pair palindromic repeats 
flanking an eight basepair core sequence (Hoess et al, Proc. Natl Acad. Sci USA 
72:3398, 1982 and U. S. Patent No. 4,959,217, the disclosure of which is incorporated 
herein by reference in its entirety). The repeat sequences act as Cre binding sites with 
the crossover point occurring in the internal spacer core. Each repeat appears to bind 

25 one protein molecule wherein the DNA substrate (one strand) is cleaved and a 

protein-DNA intermediate is formed having a 3'-phosphotyrosine linkage between 
Cre and the cleaved DNA strand. Crystallography and other studies suggest that four 
proteins and two loxP sites (each on a different DNA molecule) form a synapsed 
structure in which the DNA resembles models of four-way HoIIiday-jimction 

30 intennediates, followed by the exchange of a second set of strands to resolve the 
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intermediate into recombinant products (see, Guo, et aL Nature 2S2:40-46, 1997). 
The asymmetry of the core region of the loxP recombination sequence is responsible 
for directionality of the recombination reaction. When two loxP sites on the same 
DNA molecule are in a directly repeated orientation, Cre excises the DNA between 
5 these two sites, leaving a single loxP site on the DNA molecule (Abremski et al, Cell 
22:1301, 1983). Thus, the repeat sequences act as Cre-specific binding sites with the 
recombination crossover point occurring in the core. 

The loxP site is so complex in size that it occurs only in the PI phage genome. 
Therefore, use of the loxP sites in the invention methods assures that the enzyme will 

10 not cut the transfer sequence within the interior of the sequence imless the transfer 
sequence is from the PI phage genome. The activity of Cre in a wide variety of 
cellular backgrounds, including yeast, shows that Cre does not require host specific 
factors for activity (Sauer M?/. Cell BioL 2:2087-2096, 1987) in plant (Albert et aL, 
Plant X 2:649-659, 1995; Dale and Ow, Gene 21:79-85, 1990; Odell et aL MoL Gem 

15 Genet. 222:369-378, 1990), or mammalian cells, including both rodent and human 
cells (van Deursen et al. Proa Natl Acad, ScL USA 22:7376-7380, 1995; Agah et al, 
J. Clin, Invest. 1110:169-179, 1997; Sauer and Henderson, New Biologist 2:441-449, 
1990). 

The Cre protein also recognizes a number of variant or mutant lox sites 
20 (variant relative to the loxP sequence), including the loxB, loxL and loxR sites, which 
are found in the E. coli chromosome. Other variant lox sites include loxP5 1 1 
(5^-ATAACTTCGTAT AGTAT AC ATTA TACGAAGTTAT-3 ' (SEQ ID NO:6; 
spacer region underlined); IoxC2 

r5^-ACAACTTCGTAT AATGTATGC TATACGAAGTTAT>3' (SEQ ID N0:7; 

25 spacer region underlined; U.S. Patent No. 4,959,217). Additional variants of the loxP 
site can be prepared by those of skill in the art and will generally have no more than a 
total of one to three point mutations in the two repeats that comprise the site-specific 
recombination sequence. Cre catalyzes the cleavage of the lox site within the spacer 
region and creates a six base-pair staggered cut. The two 13 bp inverted repeat 

30 domains of the lox site represent binding sites for the Cre protein. The two lox bHqs 
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may differ so long as Cre is able to recognize both lox sites. However, if two lox sites 
differ in their spacer regions in such a manner that the overhanging ends of the 
cleaved DNA cannot reanneal with one another, Cre cannot efficiently catalyze a 
recombination evoit using the two different lox sites. The efficiency of the 
5 recombination event will depend on the degree and the location of the variations in 
the binding sites. For example, the loxC2 site can be efficiently recombined with the 
loxP site because the two lox sites differ by a single nucleotide in the left-binding site. 
Thus, when Cre is the site specific recombinase used in the practice of the invention 
methods, the site-specific recombination sequence is a loxP site, or a variant thereof 
1 0 recognized by the Cre enzyme. 

Axecombinase of the integrase family with similar function is Flp, a 
recombinase identified in strains of Saccharomyces cerevisiae that contain 2ji-circle 
DNA. Flp recognizes a DNA sequence consisting of two 13 basepair inverted repeats 
flanking an 8 basepair core sequence 

1 5 (5 '-GAAGTTCCTATT CTCTAGAAA GTATAGGAACTTC-3 ' (SEQ ID NO: 8); 
spacer imderlined) called FRT (Flp Recombination Target site). A third repeat 
follows at the 3' end in the natural sequence, but does not appear to be required for 
recombinase activity. The Flp gene has been cloned and expressed in E coli and in 
mammalian cells (PCT International Patent Application PCT/US92/01899, 

20 Publication No: WO 92/15694, the disclosure of which is herein incorporated by 
reference) and has been purified (Meyer-Lean et ai. Nucleic Acids Res, 15:6469, 
1987; Babineau et al, J. Biol Chem. 26Q:12313, 1985; Gronostajski and Sadowski, J. 
Biol Chem. 2|^: 12328, 1985). 

Like Cre, Flp is functional in a wide variety of systems including bacteria 
25 (Huang et al, J. Bacteriology 122:6076-6083, 1997), insects (Goiic and Lindquist, 
Ce// 59:499-509, 1989; Golic and Golic, Genetics 144:1693-171 1, 1996), plants 
(Lyznik et al. Nucleic Acids Res 21:969-975, 1993) and mammals (U. S. Patent Nos. 
5,677,177 and 5,654,182), which shows the Flp does not require host specific factors 
for operability. 
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Additional integrases that can be used in practice of invention methods are 
retroviral integrases, including HIV and ASV integrases (Reviewed in Annu. Rev. 
Microbiol. 53:245-81, 1990), 

In practice of invention methods for modulating expression of a target gene 
5 produce, a site-specific DNA recombinase or integrase fused to a translocating protein 
may be introduced as described herein into cells that have been transfected with a 
plasmid containing a transcription-blocking sequence (e,g., a transcription termination 
sequence) flanked by recombinase recombination sites specific for the recombinase or 
integrase used and placed between a promoter and an open reading fi^ame encoding a 
10 target gene. For example, if the recombinase is Fip, the recombinase sites are frt sites, 
and if the recombinase is Cre, the recombinase sites are lox sites. Exposure of the 
transfected cell to the recombinase or integrase-adapted translocating protein results 
in removal of the transcriptional terminator by the activity of the recombinase and 
expression of the gene of interest as illustrated in Figure 1 . 

1 5 Thus, in the invention methods for modulating a cellular process, the one or 

more regulatory elements can include a transcription-blocking sequence flanked by 
recombinase recombination sites and the regulatory agent can be a recombinase 
specific for the recombination sites, wherein translocation of the recombinase causes 
recombination of the recombination sites, thereby modulating expression of the target 

20 gene product. 

Alternatively, rather than placing a pair of recombinase sites flanking a 
polynucleotide segment to be excised, a single recombinase site can be incorporated 
into (or exist naturally in) the genome of the target cell that also contains a plasmid 
containing a target gene and a second recombinase site that pairs with the genomic 
25 recombinase site. When such a cell is contacted by a recombinase (e,g. integrase) 
specific for the recombinase site(s) in the target cell, translocation of the recombinase 
will trigger a recombination event such that the target gene will become stably 
incorporated into the genome of the target cell at the genomic recombinase site. 
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The recombinase or integrase may be introduced by mixing of two cell 
populations, one expressing the translocating protein-enzyme (e*g., Flp or Cre) fusion 
and the other containing the heterologous gene. Alternatively, the translocating 
protein-enzyme fusion may be produced in a prokaiyotic or eukaiyotic expression 
5 systOTi, purified using known methods and as described h^ein, and applied to cells 
containing the heterologous gene. The cells may be either transiently transfected with 
the heterologous gene or carry it stably integrated in their genomes. 

Alternatively, the regulatory agent used in invention methods for modulating 
gene expression can be the HIV Rev protein paired with the Rev regulatory element 
10 as regulatory element (RRE). As illustrated in Example 5 herein, increasing amounts 
of Rev protein delivered to target cells containing RRE result in increased expression 
of an operatively linked target gene. 

In another embodiment of the invention method for modifying a cellular 
process, the protein molecule fiised to a translocating protein is a Fv antibody 

1 5 fragment or a single chain antibody (sFv). Preferably polynucleotide encoding a 

fusion protein containing the translocating protein and sFv is introduced into cells in 
culture, as described herein^ for translocation to the cell nucleus and intracellular 
expression. If the sFv is specific for an antigen target associated with intracellular 
machinery involved in a cellular fimction, for example, a target located within the cell 

20 nucleus, binding of the sFv to the intracellular target can interfere with cellular 

functions, such as Ras signaling (O. Elroy-Stein, T. R. Fuerst, B. Moss, Proc, Natl, 
Acad, Set 86, 6126-6130 (1989), membrane transport (O. Cachet, et aL, Cancer 
Research 58, 1 170-1 176 (1998)), or viral replication (J. H. Richardson, J. G. 
Sodroski, T. A. Waldmann, W. A. Marasco, Proc, Natl. Acad. Set 92, 3137-3141 

25 (1995)). Additional exemplary intracellular targets for which single chain antibodies 
can be constructed and used in the invention methods to modify cellular processes 
include human kinases, transcription factors, proteins controlling apoptosis, cell cycle 
regulators, oncoproteins, and the like. 

Therefore, the one or more regulatory agents used in the invention method(s) 
30 for regulating cell processes can include a Fv or sFv specific for a component of the 
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one or more regulatory elemrats within the cells in culture (whether the regulatory 
element is native or transfected into the cells), wherein translocation of the Fv or sFv 
into the cell by the translocating protein and binding of the antibody to the component 
modulates expression of the target gene product, 

5 For example, intracellular processes have been modified by creating a fusion 

protein containing an anti-ATF-2 sFv fused to VP22 and the transcriptional activation 
domain of VP 16 (Figures 2A-D). ATF-2 belongs to the bZIP family of transcription 
factors and controls gene expression via 8-bp ATF/CREB motifs, either as a 
homodimer or as a heterodimer — for instance, with Jun (S. Huguier et al.. Molecular 

10 and Cellular Biology 1S:7020"7029, 1998). If the fusion protein is expressed within a 
reporter cell line that has ATF-2 bound upstream of a reporter gene, e.g., the bacterial 
luciferase gene, binding of the sFv in the fusion protein to the ATF-2 antigen in the 
cell nucleus (Figures 2A-D) triggers expression of the ATF-2 sFv-VP16 fusion 
(Figure 2B), but not a CREB sFv-VP 16 fusion (Figure 2C), resulting in expression of 

1 5 the reporter gene. This experiment d^onstrates that the ATF-2 sFv is delivered to 
the cell nucleus where it binds the ATF-2 antigen. 

*Tv" as used herein means a genetically engineered fragment containing the 
variable region of the light chain and the variable region of the heavy chain expressed 
as two chains but chemically linked; **sFv" as used herein means a genetically 
20 engineered molecule containing the variable region of the light chain and the variable 
region of the heavy chain, linked by a suitable polypeptide linker as a gmetically 
fused single chain molecule. 

The linkage of light chain and heavy chain variable regions in a Fv may be 
noncovalent, as described in Inbar et aL, Proc, Natl Acad. ScL USA ^:2659-62, 
25 1972. Alternatively, the variable chains can be linked by an intermolecular disulfide 
bond or cross-linked by chemicals such as glutaraldehyde. 

Exemplary linkers used to attach two segments of a Fv or to attach any other 
two proteins to (e.g., a translocating protein and a DNA binding protein) can be a 
bifunctional cleavable cross-linker, such as N-succinimidyl (4-iodoacetyl)- 
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aminobenzoate; sulfosuccinimydil (4-iodoacetyl)-aminoben2X)ate; 4-succiniinidyl- 
oxycarbonyI-a-(2-pyridyldithio)toluene; sulfosucciiiiinidyl-6- [a-methyl-a- 
(pyridyldithiol)-toluamido] hexanoate; N-succinimidyI-3-(-2-pyridyiditiiio)- 
proprionate; succinimidyl 6[3(-(-2-pyridyIdithio)-proprionamido] hexanoate; 
5 sulfosuccixiimidyi 6[3(-(-2-pyridyldithio)-propionamidoj hexanoate; 3-(2- 

pyridyldithio)-propionyI hydrazide, Elhnan's reagent, dichlorotriazinic acid, S-(2- 
thiopyridyl)-L-cysteine, and the like. Further bifimctional linking conapounds are 
disclosed in U.S. Patent Nos. 5,349,066. 5,618,528, 4,569,789, 4,952,394, and 
5,137,877, each of which is incorporated herein by reference in its entirety. 

10 These Unkers can be attached to purified proteins using numerous protocols 

known in the art, such as those described in Examples 1 and 2 (see Pierce Chemicals 
"Solutions, Cross-linking of Proteins: Basic Concepts and Strategies,'* Seminar #12, 
Rockford, IL). 

Preferably the antibodies used in the invention methods are sFv, comprising 
15 Vh and Vl chains connected by a peptide linker. These single-chain antigen binding 
proteins (sFv) are prepared by constructing a structural gene comprising DNA 
sequences encoding the Vh and Vl domains connected by an oligonucleotide. The 
structural gene is inserted into an expression vector, which is subsequently introduced 
into a host ceil such as E, colL The recombinant host cells synthesize a single 
20 polypeptide chain with a linker peptide bridging the two V domains. Methods for 
producing sFvs are described, for example, by Whitlow and Filpula, Methods^ 2: 97- 
105, 1991; Bird et aU Science 2^:423-426, 1988; Pack et aL, Bio/Technology 
11:1271-77, 1993; and Ladner et aL, U.S. patent No, 4,946,778, which is hereby 
incorporated by reference in its entirety. Such well known procedures can be 
25 modified to create fusion proteins comprising a sFv and a translocating protein, as 
described herein. 

For example, the linker in the sFv can be a peptide having from about 2 to 
about 60 amino acid residues, typically fi"om about 5 to about 40, preferably from 
about 10 to about 30 amino acid residues. This alternative is particularly 
30 advantageous when the ligand moiety is proteinaceous. For example, the linker 
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moiety can be a flexible spacer amino acid sequence, such as those known in single- 
chain antibody research. Examples of such known linker moieties include CjGGGS 
(SEQ ID NO:9), (GGGGS)„ (SEQ. ID NO:10), GKSSGSGSESKS (SEQ ID NOrll), 
GSTSGSGKSSEGKG (SEQ. ID NO: 12), GSTSGSGKSSEGSGSTKG (SEQ ED 
5 NO: 13), GSTSGSGKSSEGKG (SEQ ID NO:14), GSTSGSGKPGSGEGSTKG (SEQ 
ID NO:15X EGKSSGSGSESKEF (SEQ ID NO:16), SRSSG (SEQ. ID NO:17), 
SGSSC (SEQ ID NO: 18), and the like, A Diphtheria toxin trypsin sensitive linker 
having the sequence MGRSGGGCAGNRVGSSLSCGGLNLQAM (SEQ ID NO: 19) 
is also useftil. Altematively, the peptide linker moiety can be VM or AM, or have the 

1 0 structure described by the formula: AM(G2 to 4S)xAM wherein X is an integer from 1 
to 1 1 (SEQ ID NO:20). Additional linking moieties are described, for example, in 
Huston et al, PNAS M.:5879'5883, 1988; Whitlow, M., et aL, Protein Engineering 
6:989-995, l993;lSlcmonetal., Biochemistry 21*^5-553, 1996; A. J. CumhoretaL, 
Bioconj, Chem. 3:397-401, 1992; Ladumer ^z/., J, MoL Biol 222:330-337, 1997; 

15 and U.S. Patent. No. 4,894,443, the latter of which is incorporated herein by reference 
in its entirety. 

It is contemplated to be within the scope of the present invention that the 
target gene within a cell in culture can be a reporter gene, such as is known in the art, 
for example, a non-endogenous gene encoding a detectable marker, such as the E, coli 
20 B-galactosidase gene, luciferase, or CAT. 

As an aid in purifying fee fiision molecules or detecting expression triggered by 
use of the invention methods, it is often convenient to include in the polynucleotide that 
encodes the reporter gene an additional nucleotide sequence that encodes a protein 
tag, such as an antibody epitope (e.g., derived from Myc), a fluorescent peptide, or a 
25 poly His tag. 

A variety of methods can be used for attaching a peptide or oligonucleotide 
molecule to a translocating protein. For example, the translocating protein can be 
covaiently conjugated to a translocating protein or to a polynucleotide encoding a 
translocating polypeptide for use in the invention methods using two low molecular 
30 weight chemical affinity ligands that can be attached to macromolecules like DNA 
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and to proteins and which combine to form a linker useful in preparation of fusion 
proteins or fusion genes used in the invention methods* Two such low molecular 
weight chemical affinity ligands are salicylhydroxamic acid (SHA) and phenylboronic 
acid (PB A), which quickly react to form a reversible pH-sensitive covaient bond 
5 (Figures 3 A-C), thus providing a convenient linker to attach a translocating protein to 
another protein or to a polynucleotide. For example^ nucleic acid molecules 
containing PB A can be synthesized using PBA-NTPs (available from ProLinx, 
Seattle, Washington) or, if double stranded, can be labeled with PBA-ATP using the 
enzyme terminal transferase. SHA-NHS ester can also be used to attach SHA to 

10 lysine residues present in a translocating protein, hi this embodiment, a PBA-adapted 
molecule and a SHA-adapted translocating protein are covalently linked and applied 
to cells. Alternatively, other linkers, such as disulfide bonds (which would be 
disrupted upon delivery to the cytoplasm) or bifiinctional linker molecules (e.g., as 
disclosed herein) may also be used. Covaient linking using these or other linkers 

1 5 known in the art and disclosed herein provides a relatively stable attachment of the 
translocating protein to another molecule. 

In addition, a number of strong non-covalent molecular interactions can be 
used to generate translocating protein-containing complexes. For example, 
Strepavidin binds biotin very strongly (the disassociation constant is ^proximately 

20 10"^^). This strong affinity, which is routinely used to attach proteins to substrates, 
can be used to form a Unker that attaches a cell process-modifying molecule to a 
translocating polypeptide. For example, a fusion protein containing VP22 
translocating protein and strepavidin may be generated and complexed with a 
biotinylated oUgonucleotide to form a linker attaching a cell process-modifying 

25 polynucleotide to a translocating polypeptide. Strepavidin binds biotin as a tetramer 
(tetramer MW = 60,000 daltons) and VP22 is beheved to act as a multimer, making 
this combination a suitable one. 



30 



Another polypeptide molecule that may be used as a linker to attach a cell 
process-modifying molecule to a translocating polypeptide for use in invention 
methods is the single stranded DNA binding protein (SSB) fi-om E colu Only 21 
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amino-acid residues (amino acid residues 2 through 22) of SSB appear to be involved 
in binding to ssDNA (i.e., "the functional fragment of SSB"). Furthermore, binding 
of SSB to ssDNA is not sequence specific. Therefore, a fusion protein containing a 
translocation protein fused to a functional fragment of SSB is an extremely attractive 
5 linker for attaching a translocating protein to an oligonucleotide or to plasmid DNA. 
Unlike the linking molecules described above (i.e., those containing PBA and SHA or 
strepavidin and biotin), which require modification of the oligonucleotide to be linked 
to the translocating protein, fusion proteins containing a SSB and a translocating 
protein can be attached to unmodified DNAs, thus providing time and cost-saving 
10 advantages. 

The invoition will now be described in greater detail by reference to the 
following non-limiting examples, 

EXAMPLE 1 

Introduction of VP22 fusion protein into cells in culture by transfection 

1 5 The complete open reading fi-ame (ORF) encoding the VP22 protein was 

cloned into the eukaryotic expression vector pcDNA3.1/myc-His (Invitrogen, San 
Diego, CA), to create the vector pVP22/Afyc-His (Figure 5; SEQ ID NO:l), in which 
the ORF of the fusion partoer can be inserted into a multiple cloning site located 
between the VP22 ORF and sequences encoding the C-terminal Anti-myc epitope and 

20 a poly His tag. The anti-myc epitope provides for easy detection of recombinant 
protein with Anti-myc antibody, and the poly His tag is useful for purification. 
Alternatively, the vector used was modified by covaient coupling of the Vaccinia 
Virus Topoisomerase I protein to linearized vector DNA (e.g., pVP22 TOPO® TA 
Cloning® Kit (Invitrogen)). In this type of vector, the ORF of a gene product of 

25 interest (i.e., a "fusion partner") was cloned as a PGR product into the vector. An 
example of such a Topoisomerase-adapted vector encoding the VP22 polypeptide is 
pVP22/Myc-His TOPO® vector (Figure 7; SEQ ID NO:2). In either case, the 
plasmid containing the VP22 gene fusion was then transfected into cells in culture 
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In a typical transfection, COS or CHO cells were seeded into 6 well plates and 
grown to approximately 50% confluence prior to transfection. For each well, 5 
DNA was diluted into 1.5 mi OptiMEM medium (Gibco BRL, Chagrin Falls, OH) 
and mixed with 15 jil Pfe-6 lipid (Invitmgen) for COS cells or 15 |ul Pfe-7 lipid 
5 (Invitrogen) for CHO cells. Diluted DNA plus lipid was incubated with cells for 4 hr 
at 37 °C, then replaced with the appropriate medium and incubated for an additional 
40-48 hr at 37X, 

Spreading of the VP22 fusion protein from the transfected cells to the 
surrounding untransfected cells was detected by immunofluorescence using an 

10 antibody against the myc epitope tag. In a typical innmunofluorescence experiment, 
transfected cells in a single 35 mm well of a six well tissue culture plate were washed 
with phosphate buffered saline (PBS) and fixed by incubation in 2 ml of methanol for 
5 min. Cells were w^hed five times with PBS (2 ml/wash), blocked for 1 5 min 
using PBS containing 10% fetal bovine serum (FBS), and then mcubated for 20 min 

15 with an antibody against the myc epitope tag (Invitrogen) diluted at 1 :500 in 1 ml of 
PBS contaming 10% FBS, For attachment of a fluorescent molecule to the antibody, 
cells were washed twice with PBS and incubated with a goat anti-mouse Oregon 
Green conjugate (Molecular Probes, Eugene, OR; cat # 0-6383) diluted 1:500 in 1 ml 
of PBS containing 1 0% FBS for 20 minutes. After two additional washes, the 

20 antigen:antibody complexes were observed using an Olympus IX-70 fluorescence 
microscope equipped with a fluorescein isothiocyanate (FITC) filter. 

Translocation of several VP22 fiision proteins prepared in this way, including 
those incorporating Aequorea victoria green fluorescent protein (hQGFP), lacZ, or the 
site specific recombinase Flp as the fiision partner, has been achieved by this method. 

25 EXAMPLE 2 

Transfection of cells with a gene fusion followed by mixing with untransfected 
cells 

To demonstrate how VP22 may be used to modulate expression of a fimctional 
gene product, a system for delivery of the site specific DNA recombinase Flp, was 
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developed, COS cells expressing a VP22-Flp recombinase fusion protein were 
prepared as described above and mixed with CHO cells that had been transfected with 
a reporter plasmid pFIN4//acZ (Figure 1). In the reporter plasmid, a segment of DNA 
that includes a transcriptional terminator, the Bovine Growth Hormone 
polyadra^ylation signal (Goodwin andRottman, J. Biol Chem. 262:16330-4, 1992), is 
flanked by frt sites (recombination sites recognized by the recombinase Flp) to 
separate the CMV promoter and an otherwise operatively associated reporter gene 
encoding P-galactosidase as reporter. Cells transfected with pFIN4//^cZ did not 
express P-galactosidase due to the presence of the transcriptional terminator placed 
between the^r^ sites. 

To illustrate that expression of the reporter gene could be controlled by 
translocating of the VP22-Flp recombinase fusion protein from one cell population to 
another, two populations of CHO cells were prepared, one transfected with plasmid 
that expresses the VP22-Flp recombinase fusion protein, and another transfected with 
1 5 plasmid that expresses a VP22-GFP fusion protein. Transfections were carried out as 
described above. Twenty-four hours after the end of the transfection, cells were 
recovered by trypsinization. Then the two cell lines were mixed and incubated for an 
additional 24 hr before staining for p-galactosidase activity. 

CHO cells transfected with pFIN4//acZ only expressed P-galactosidase when 
20 mixed with COS cells that express the VP22-Flp fusion. In the presence of Flp 
recombinase, the segment of DNA containing the transcriptional terminator was 
removed by recombination of the frt sites, and p-galactosidase was expressed. 
Incubation of the population of CHO cells transfected with plasmid that expresses a 
VP22-GFP fusion protein, but does not express a VP22-Fip fusion protein did not 
25 result in expression of P-galactosidase. 

This experiment shows that the VP22-Flp fusion protein translocates between 
different manamalian cell types and that functional Flp recombinase can be delivered 
to cells as the fusion partner in a VP22 fusion protein. 



5 
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EXAMPLE 3 

Transfection of cells with a gene fusion followed by preparation of a cell free 
lysate from the transfected cells 

In these studies, a cell fee lysate was prepared from cells transfected with 
5 pVP22/myc-His as follows: COS cells were grown to 50% confluence in a 100 mm 

dish (approximately 10^ ceils). Cells were transfected with 20 jag of pVP22/myc-His 
DNA using Pfe-6. Forty hours post-traasfection, the cell monolayer was washed 
twice with PBS and then collected by scraping into 10 ml PBS. Cells were 
centrifiiged at 500 g for 5 min and the PBS was aspirated from the cell pellet, which 
1 0 was then frozen on dry ice. Frozen cell pellets were stored at -80^C prior to 

preparation of lysates. The cell pellet was thawed on ice following addition of 0,5 mi 
ice cold lysis buffer (10 mM HEPES, pH 7.9, 400 mM NaCl, 0.1 mM ethylene 
diaminetetraacetic acid (EDTA), 0,5 mM dithiothreitol (DTT), 5% glycerol). The 
lysate was then vortexed briefly and centrifuged at 10000 X g for 5 min at 4^C. 

15 The entire supernatant was immediately added to 2 x 10^ cells in a 35 mm 

plate without removing the tissue culture media. After a 10 minute incubation at 
37°C, the media was removed and VP22/myc-His protein located in the nuclei of the 
cells was detected by immunofluorescence as described above. 

An alternative method for the detection of VP22 fusion protein uptake in 
20 mammalian cells from a cell free lysate prepared from cells that express the fiision 
protein utilizes Western blot. In a typical Western blot experiment, HeLa, COS or 
CHO cells were plated at 50% confluence in 60 mm dishes. Following application of 
the lysate, the cells were washed once with PBS and then with PBS containing 500 
mM NaCl to remove protein non-specifically bound to the outside of the cell. The 
25 cells were treated with trypsin for about 5 minutes to disassociate them from the plate 
and to digest any remaining extracellular peptide. The cells were solubilized and the 
proteins separated on a 4-20% Glycine gel (Invitrogen, Carlsbad, CA). The separated 
proteins were then transferred to nitrocellulose and probed with the appropriate 
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antibody conjugated to horseradish peroxidase (HRP). The VP22 fusion proteins 
were then detected using chemiluminescence. 

Thus, the VP22/myc-His protein contained in the lysates of cells transfected 
with pVP22//w7c-His translocated to the nuclei of all untransfected cells within 10 
5 minutes of contact. This finding shows that lysates containing VP22 are useful for the 
delivery of protein sequences into cell types without the need for transfection of the 
receptor cell population. 

EXAMPLE 4 

Expression of a VP22 fusion protein in E.coIi. followed by application of purified 
10 protein to ceils in culture 

The vector pCRT7A^22-l was developed to allow expression and 
purification of VP22 fusion proteins firom E. coli. This vector utilizes a C-terminal 
firagment of the VP22 protein (amino acids 159-301), which has proven sufficient for 
translocation of VP22 fusion proteins across cell membranes. Using the above 

1 5 described methods, VP22 fusion proteins were prepared containing various proteins 
as the fusion partner (including the HIV Rev protein and human protein rhoA), and 
the fusion proteins were expressed and purified . Activity of each fusion partner was 
demonstrated following uptake by cells in culture. To demonstrate the high efficiency 
with which translocation occurs in cell cultures, even when the cells transfect poorly 

20 using conventional techniques, uptake of a VP22/GFP fusion proteins by Jurkat 
T-cells and PCI 2 cells, which are known to be refi^tory to standard transfection 
protocols, was also performed. These experiments show that VP22 fusion proteins 
can be purified and then delivered to substantially every cell in a cultured mammalian 
cell population, completely eliminating the need for transfection, even when the cell 

25 line is known to be refiractory to standard transfection protocols. 

pCRT7AnP22-l is derived firom the pET9b vector backbone (Novagen, 
Madison, WI). In preparation of pCRT7AnP22-l, the sequence encodmg the C- 
terminal region of VP22 sufficient for translocation activity (amino acids 159 -301), a 
firagment containing a multiple cloning site and myc and His tags fi"om the vector 
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pVP22/myc-His were inserted into the pET9b vector backbone. The multiple cloning 
site of pCRT7A/^P22-l was derived from that of pVP22/myc-His. The 
pCRT7AT22-l vector was prepared for coupling to Vaccinia Topoisomerase I in 
exactly the same way as in preparation of the pVP22/myc-His-TOPO® plasmid, as 
5 described above. Therefore, in this vector, the sequence encoding tiie ORF of a fusion 
partner can be either inserted into one of the multiple cloning sites or cloned as a PGR 
product into the topoisomerase cloning site in a way similar to that used with 
pVP22/myc-His or pVP22/myc-His-TOPO® plasmid. 

In a typical experiment a VP22 fusion protein was expressed as follows. Ten 
10 ng pCRT7Aa'22-l DNA was transformed into 50 BL21 (DE3)plysS ceils. The 
transformed cells were incubated at 37°C for 1 hour in 200 ^1 SOC medium, which 
was then diluted to 2 ml with Luria-Burtoni (LB) medium plus 50 |ig/ml kanamycin 
and allowed to grow overnight at ZTC. The 2 ml culture was used to inoculate 50 ml 
LB medium containing 50 jig/ml kanamycin. Cells were allowed to grow until an 
1 5 optical density of 0.5 - 0.6 was attained and then allowed to continue growth at either 
37'*C, or shifted to room temperature (approximately 25°C) for 30 min. One ml of 
culture was removed and allowed to continue growing. Isopropyl-P-D- 
thiogalactopyranoside (IPTG) was added to the remaining culture to a final 
concentration of 1 mM. Cells were allowed to grow for an additional 4 hours and 
20 then gel samples were prepared from induced (plus IPTG) and non-induced cultures. 
200 \x\ of each culture were removed, cells were recovered by centrifiigation, and the 
pellets raised in 50 |iil 1 X SDS/PAGE sample buffer. Alternatively, cells were 
recovered from the remainder of the culture by centrifugation and the cell pellets 
stored at •SO^'C- 

25 The VP22 fusion protein was purified as follows: The cell pellet was thawed 

on ice and resuspended in 4 ml ice cold lysis buffer (50 mM Sodium Phosphate pH 
8.0, 300 mM NaCl, 5 mM imidazole). The following were added to the lysis buffer 
immediately before cell lysis: P-mercaptoethanoI to 5 mM, a-toluenesulfonyl fluoride 
(PMSF) to 0.5 mM, leupeptin and pepstatin to 1 p-g/ml each, and lysozyme to 

30 1 mg/ml. The lysate was incubated on ice for 20 to 30 min and then sonicated for 
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3x10 sec while on ice. DNase and RNase were added to final concentrations of 
10 jxg/inl each. The lysate was left on ice for an additional 20 min, then drawn 
through a 21 gauge needle three times, and centrifdged at 20000 g for 15 min. 
Following centrifugation, a gel sample was prepared from the soluble supernatant 
5 The supernatant was applied to a column containing 1 ml Probond resin equilibrated 
with lysis buffer (Probond beads interact with proteins tagged with 6 histidine arrays). 
The resin and supernatant were mixed in the colunm on ice for one to two hours. The 
column was then clamped vertically and the resin was allowed to settle. 

A sample of the supernatant was removed to test for the presence of unbound 
10 protein by SDS/PAGE. The resin was washed by allowing 10 ml lysis buffer (50 mM 
Sodium Phosphate pH 8.0, 300 mM NaCl, 5 mM imidazole) to pass through the 
column. The lysis buffer was collected and a gel sample was removed. The column 
was then washed with 20 ml wash buffer (50 mM Sodium Phosphate pH 8,0, 300 mM 
NaCl, 40 mM imidazole, 10 % glycerol) and another gel sample was prepared in the 
1 5 same way. Protein was eluted by addition of buffers having mcreasing concentrations 
of imidazole (wash buffer with either 100 mM, 200 mM or 500 mM imidazole). 3 ml 
of each buffer was ^plied, and 3 ml of each of the 100 mM and 200 mM imidazole 
elutions were collected. The 500 mM imidazole elution was collected as 0.5 ml 
fractions. A gel sample was also prepared from 10 fig of the resin after elution to 
20 determine if the protein remained bound. All samples were examined on 4-20% 

SDS/PAGE gels (Novex) followed by Coomassie Staining or Western blot using an 
anti-myc-HRP conjugated antibody (Invitrogen) at 1:2000 dilution. Purified proteins 
were stored at 4°C for immediate use, or frozen at -SO^C for storage. 

Uptake of VP22 fiision proteins was detected by immunofluorescence. Cells were 
25 grown to approximately 50% confluence in 35 mm wells. The medium was then 

removed and replaced with 1 ml of serum free medium. Approximately 10 ^g of the 
purified VP22 fiision protein, eluted in wash buffer containing 500 mM imidazole, 
was added directly to the SOTun-free medium. Cells were incubated at 37^C for 20 
min and then wadied with 3 x 2 ml PBS. Cells were then fixed and permeabiized in 
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methanol for 5 min and prepared for immunofluorscence as described previously (see 
Invitrogen pVP22/myc-His Vector, cat no. V484-1). 

Alternatively, uptake of VP22 fiision proteins was detected by Western blot. 
This technique was used to detect accumulation of VP22 fusion protein in the nuclei 
5 of PC12 cells and Jurkat T-cells. Suspensions of approximately 5 x 10^ cells of e^h 
type were transferred to 15 ml Falcon tubes. Cells were recovered by centrifiigation 
at 500 g for 5 min and then resuspended in 10 ml PBS, Cells were washed again in 
the same way, then resuspended in 1 ml serum-free medium containing approximately 
10 |Ltg of the VP22/GFP fusion protein and incubated at Sl'^C for 15 min. Following 

10 the incubation, cells were washed twice by centrifiigation and resuspended in 10 ml 
PBS as before described. Cells were recovered by centrifiigation again, raised in 
100 \xl ice cold lysis buffer (10 mM HEPES-KOH, pH 7.9, 1.5 mM MgCk, 10 mM, 
KCl, 0.5 mM dithio threitol (DTT), 1% Triton X-100), and incubated on ice for 10 
min. The lysate was centrifuged at 10000 g for 10 min. The supernatant, contaming 

15 soluble cytoplasmic proteins, was removed, and 4 X protein sample buffer was added 
to the supematant. The pellet, containing cell nuclei, was resuspended in 100 nl 1 X 
protein sample buffer. Samples were run on 4-20% SDS/PAGE gels (Novex) and 
transferred to nitrocellulose membrane. Western blote were probed with an anti-myc- 
HRP antibody conjugate (Invitrogen). 

20 FXAMPLE 5 

Activity of a VP22 fasion protein in recipient cells: functional testing of a 
VP22/Rev fusion protein 

The HIV Rev protein is encoded by HIV genomic RNA and is responsible for 
regulation of RNA spUcing. The Rev protein can bind to transcripts that contain a 
25 Rev Response Element (RRE), allowing export of the transcript firom the nucleus and 
subsequent translation (reviewed in V. W. Pollard et al., Ann. Rev, MicobioL 52:491- 
532, 1998). In the absence of Rev, transcripts that contain RRE will complex with the 
mv spliceosome, but are not spliced. Instead, they remain in the nucleus and are 
degraded. 
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In the following experiment, the binding of Rev to the RRE in a transcript was 
used to activate expression of a reporter gene. A reporter plasmid (pCAT/RRE) was 
prepared that contains a CMV promoter and a CAT reporter gene separated by a 
splice donor site. The RRE was located on the 3' side of the CAT gene site. 
5 Therefore, expression of CAT in response to Rev can be used to demonstrate the 
activity of Rev in the VP22/Rev fusion protein. 

CHO cells were transfected with pCAT/RRE and then treated with either 
VP22/Rev fusion protein or VP22/myc-His control fusion protein. Expression of the 
CAT reporter gene was examined by Western blotting of protein samples prepared 

10 from treated cells, using an antibody against the CAT protein. A sample was also 
prepared from cells transfected with a CMV-CAT positive control plasmid that does 
not contain the RRE- Expression of the positive control could be detected. When 
cells were transfected with pCAT/RRE and then treated with VP22/myc-His control 
fusion protein, no expression of CAT could be detected. However, when cells 

1 5 transfected with pCAT/RRE were treated with VP22/Rev fusion protein, expression 
of CAT could be detected. When a five-fold larger amount of VP22/Rev protein was 
added, an apparent increase in the level of CAT protein was detected. These results 
show that VP22 can deliver functional Rev protein to the nucleus and lead to 
expression of a reporter gene. 

20 In HIV infected cells, the Rev protein can shuttle between the cytoplasm and 

nucleus. The distribution of Rev between these intracellular compartments is 
dependent on a nuclear export signal present in the protein. To determine whether the 
nuclear export signal functioned in the invention VP22/Rev fusion protein, the 
distribution of VP22/Rev protein was examined by immunofluorescence using an 

25 antibody against the myc epitope tag, as described above. By this procedure, 

VP22/Rev fusion protein was detected in the cytoplasm and nuclei of cells, showing 
that fusion of Rev to VP22 appears not to interfere with the ability of Rev to be 
distributed to either the cytoplasm or the nucleus. 
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FXAMPLE 6 

Delivery of one or more molecules into cells in order to modify cellular processes 

The following experiment demonstrates how a cellular process may be 
modified in a cell by delivery to the cell of a VP22 fusion protein lhat contains as the 

5 fusion partner the small GTPase, rhoA, which is involved in the polymerization of 
actin microfilaments in mammalian cells. Previous studies have shown that when 
Swiss 3T3 cells are starved of serum for 16 hr, actin microfilaments involved in 
mamtaining the sh^e of cells depolymerize, leaving soluble actin monomers. 
Addition of serum causes rapid repolymerization of actin, restoring the 

1 0 microfilaments. This effect has been produced by microinjection of cells with 

activated rho A protein that has been expressed and purified from E. coh (A. Hall, 
Science 202:509-514, 1998). 

To test whether a VP22-rhoA fusion protein could generate a similar effect, a 
VP22-rhoA fusion protein was expressed and purified fi-om E. coh using 

15 pCRT7AT*22-l-TOPO® plasmid. Swiss 3T3 cells were treated with the purified 

protein as follows: 3T3 cells were grown to approximately 50% confluence in 35 mm 
wells. Then the medium was removed and replaced with 1 ml of serum firee medium. 
Cells were incubated for an additional 20 hr at 37°C. Approximately 1 ^g of either 
purified VP22/rhoA or VP22/myc-ffis fusion protein was then applied to the cells. 

20 Twenty minutes later, cells were washed with 3 x 2 ml PBS, then fixed for 5 mm in 
4 % formaldehyde (firom Invitrogen p-galactosidase Staining Kit). Cells were washed 
again with 3 x 2 ml phosphate buffered saline (PBS), permeablized for 5 min with 
0.1% Tween-20® detergent in PBS, then washed again with 2 x 2 ml PBS. Cells 
were blocked for 30 min with 10% fetal bovine serum (FBS) ua PBS before 

25 incubation with 0. 1 ^g/ml final concentration of FITC conjugated phalloidin (Sigma 
P-5282) in PBS/10% FBS for 30 min. 

Phalloidin binds to polymerized actin more strongly than to depolymerized 
actm, thus allowing for visualization of repolymerized microfilaments. Cells were 
washed again with 2 x 2 ml PBS prior to observation with an Olympus fluorescence 
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microscope and FITC filter. The purified fusion protein was applied to serum-starved 
3T3 cells. In cells that had been serum-starved and then treated with VP22/myc-ffis 
control fusion protein, no actin microfilaments could be detected and the cells 
appeared similar to serum-starved cells that had not been treated with either fixsion 

5 protein. By contrast, in cells that had been treated wifh. VP22-rhoA fusion protein for 
20 minutes, actin microfilaments could be clearly detected by binding of phalloidin. 
The distribution of actin microfilamaits in cells that had been treated with VP22-rhoA 
fusion protein appeared similar to that seen in cells that had neither been treated with 
a fiision protein nor serum-starved. These results indicate that VP22 can be used to 

1 0 deliver a functional rho A fusion protein to cells. 

The wild type rho A protein appears to stimulate polymerization of actin 
microfilaments from the cell membrane, but VP22 protein is normally transported to 
the cell nucleus. Since VP22/rhoA could stimulate the polymerization of actin 
microfilaments in a similar way, the distribution of VP22/rhoA protein was examined 
1 5 by immunofluorescence using an antibody against tiie myc epitope tag of the protem 
(Invitrogen). Most of the VP22/ilioA fusion protein could be detected in the 
cytoplasm of recipient cells and very little protein appeared to reach the nuclei. These 
stodies show that VP22/rhoA protein may be retained at the sites of riioA activity and 
not completely translocated to the nucleus. 

20 F.XAMPLE7 

Delivery of a VP22 fusion protein to a specific ceUular compartment by 
modification of VP22. 

The following expaiment demonstrates use of VP22 fiision protein to regulate 
distribution of the fusion partner within a specific cellular compartment. The HIV 
25 Rev protein (C. M. Troy et al., Neuroscience 1^:253-6 1 , 1 996) contains a leucine rich 
sequence that is sufficient to direct heterologous sequences out of tiie nucleus and into 
the cytoplasm. Furthermore, it has been shown that fiision of the Nuclear Export 
Signal (NES) to a heterologous protein that mcludes the canonical SV40 larger T 
antigen Nuclear Localization Signal results in distribution of tiie protein between the 
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cytoplasmic and nuclear compartments (W. Wen et al. Cell £2:463-473, 1995), 
Similarly, the Rev protein contains sequences for both nuclear import and export and 
is found in both the c3^oplasmic and nuclear compartments (U. Fischer et al.. Cell 
S2:475^83, 1995). 

5 To test the ability of a translocating protein to deliver a fusion partner to a cell 

location other than the cell nucleus, in the present experiment, a fusion protein that 
consists of VP22/myc-His with the eleven amino-acid Rev NES inserted between the 
VP22 ORF and the myc epitope tag was expressed in E. coU, purified as described 
above, and applied to cells in culture. 

1 0 Distribution of the fusion protein among the cellular compartments in the cells 

in culture was examined by immunofluorescence as described above. The distribution 
of the fusion protein was verified by western blot analysis of treated cells, as follows: 
A suspension of 5 x 10^ cells was transferred to 15 ml Falcon tubes. Cells were 
recovered by centrifiigation at 500 g for 5 min and flien resuspended in 10 ml PBS. 

1 5 Cells were washed again in the same way, then resuspended in 1 ml serum-free 

medium containing approximately 10 ^g VP22/GFP fusion protein, and incubated at 
Sl'^C for 15 min* Following the incubation, cells were washed twice by 
centrifiigation and resuspension in 10 ml PBS as before. Cells were recovered by 
ceQtrifiigation again and raised in 100 jil ice cold lysis buffer (10 mM HEPES-KOH, 

20 pH 7.9, 1.5 mM MgCli, 10 mM, KCl, 0,5 mM DTT, 1% Triton X-100® detergent) 
and incubated on ice for 10 min. The lysate was centrifuged at 10000 g for 10 min. 
The supernatant, containing soluble C3/toplasmic proteins, was removed and 
supplemented with 4 X protein sample buffer. The pellet, containing cell nuclei, was 
resuspended in 100 ^il 1 X protem sample buffer. Samples were run on 4-20% 

25 SDS/PAGE gels (Novex) and transferred to nitrocellulose membrane. Western blots 
were probed with an anti-myc-HRP antibody conjugate (Mvitrogen). These tests 
show that Rev NES adapted-VP22 -containing fusion protein can distribute into the 
cytoplasm and nuclei of treated cells. 
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EXAMP LES 

Use of a VP22 fusion protein as a component of an inducible gene expression 
system. 

To test the theory that a translocating protein can be used in an inducible gene 
5 expression system with great specificity, a T7 RNAPA/P22 fusion protein was 

expressed and purified fi-om E. coli using a protocol similar that described above for 
other VP22 fiision proteins. RNA polymerase activity was examined in an in vitro 
transcription assay. All reagents were fi-om an in vitro transcription kit (Ambion, 
Austin, TX), and were used according to the manufacturer's instructions. The amount 
10 of RNA produced by the presence of the T7 RNAPAT22 fiision protein was found to 
be similar to that of the T7 KNAP included in the kit. 

A reporter construct that contains a luciferase gene driven by a T7 promoter 
was also constructed. This construct was transfected into COS cells and 24 hours 
later purified T7 RNAPAT22 fiision protein was applied to the cells. After an 

15 additional 24 hours, cell lysates were prepared and examined for luciferase en2yme 
activity using a luciferase assay kit (Promega) according to the manufacturer's 
instructions. Addition of T7 RNAPAT>22 fiision protein to cells tnmsfected with the 
reporter gene resulted in five- to ten-fold increase above background in the level of 
luciferase expression, indicating that this system fimctions to control the expression of 

20 heterologous genes in eukaryotic cells. 

EXAMPLE 9 

Covalent and non-covalent coupling to translocating proteins. 

Peptide or oligonucleotide molecules may be covalently conjugated to 
translocating proteins using Linx® chemical affinity system (Invitrogen) which uses 
25 low molecular weight chemical affinity ligands salicylhydroxamic acid (SHA) and 
phenylboronic acid (PBA)* In this system, the low molecular weight chemical affinity 
hgands are used to form a bifunctional linker that attaches the translocating protein to 
a polynucleotide by means of a reversible pH-sensitive covalent bond. 
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Nucleic acid molecules containing PBA can be synthesized using PB A-NTPs 
(ProLinx, Seattle, WA) or, if double stranded, labeled with PBA-ATP using the 
enzyme terminal transferase. SHA-NHS ester can be used to attach SHA to lysine 
residues present in translocating proteins (Figures 3 A-C). The PBA-adapted molecule 
5 and the SHA-adapted translocating protein are then covalently linked and 

administered to cells. A fiill description of the procedures and conditions used to link 
proteins using this system is publicly available (Linx™ Rapid Protein Conjugation 
Kit, Catalog Nos K8050-01 to K8060-01, Invitrogen, San Diego, CA). 

E X AMPLE 1 0 

1 0 Assay for uptake of translocating protein: oligonucleotide conjugates 

A translocating protein and oligonucleotides of varying lengths can be 
conjugated and added exogenously to mammalian tissue culture cells. Single stranded 
DNA (ssDNA) of varying lengths containing PBA-ATP can be synthesized using 
PGR. A biotinylated 5' primer can be designed to allow purification of single- 

1 5 stranded molecules containing the PBA-ATP on a strepavidin coliunn. A series of 3* 
reverse primers can be generated to facilitate the synthesis of a number of ssDNA 
molecules between 20 and 2000 nucleotides in length. The purified ssDNA 
molecules containing PBA will then be mixed with the translocating protein-SHA. 
Different concentrations of the protein: oligonucleotide conjugate can then added to 

20 cells and allowed to incubate for up to 4 hours. After incubation, the cells can be 

washed, fixed, and then probed using a strepavidin-FITC conjugate. Any internalized 
oUgonucleotides will bind the strepavidin-FITC and be detected by fluorescence. It is 
expected that short oligonucleotides will be internalized very efficiently (Le. 
deliv^ed to 1 00% of the cells) and be concentrated within the nucleus* 

25 While the invention has been described in detail with reference to certain 

preferred embodiments thereof, it will be understood that modifications and variations 
are within the spirit and scope of that which is described and claimed. 
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WHAT IS CLAIMED IS: 

1 . A method for modiilating a cellular process, said method comprising 
contacting a cell in culture under suitable conditions with a cell process-modifying 
molecule attached to a translocating polypeptide, whereby the cell process-modifying 
molecule is translocated into the cell in culture and interacts specifically therein with 
a target site responsive to the cell process-modifying molecule, tliereby modulating a 
cellular process in the cell in culture. 

2. A method for tratisfecting a cell in culture with a target gene, said method 
comprising contacting the cell in culture under suitable conditions with a 
polynucleotide comprising the target gene attached to a translocating polypeptide, 
whereby the cell in culture is transfected by the target gene. 

3 . The method according to claim 2 wherein the translocating polypeptide is a 
VP22 polypeptide, Antp, or Protein H. 

4. The method according to claim 2 wherein the translocating polypeptide is a 
VP22 polypeptide and the polynucleotide is translocated into the nucleus of the cell in 
culture. 

5. The method according to claim 2 wherein the polynucleotide is linear or 
circular DNA containing a cloned open reading frame that encode the target gene, 

6. The method according to claim 5 wherein the polynucleotide is a supercoiled 
plasmid. 

7. The method according to claim 2 wherein the translocating polypeptide is 
attached to a DNA binding protein and the DNA binding protein links the 
translocating polypeptide to the polynucleotide. 
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8. The method according to claim 7 wherein the DNA binding protein is histone 
1 protein, high mobiUty group 17 protein (HMG17), a polylysine sequence, or an 
oligopeptide having at least three LARL repeats therein, 

9. The method according to claim 2 wherein the translocating polypeptide is 
attached to a nuclear export signal and the polynucleotide is transfected into the 
cytoplasm as well as the nucleus of the cell in culture. 

1 0. The method according to claim 9 wherein the nuclear export signal is derived 
from the HIV Rev protein or the heat stable inhibitor of cAPK. 

1 1 . The method according to claim 2 wherem the target gene is stably integrated 
into the genome of the cell in culture* 

12. A method for modulating expression of a target gene product in a cell in 
culture that contains a target gene imder control of one or more regulatory elements, 

said method comprising contacting the cell in culture under suitable conditions 
with one or more regulatory agents attached to a translocating polypeptide, whereby 
the one or more regulatory agents are translocated into the cell in culture and interact 
therein with the one or more regulatory elements, thereby modulating expression of 
the target gene product by the cell. 

13. The method according to claim 12 wherein the cell in culture is a mammalian, 
yeast, insect or plant cell. 

14. The method according to claim 12 wherein the translocating polypeptide has 
the properties of: 

resistance to proteolysis, 

receptor-independent penetration of cell membranes, and 
energy-free penetration of cell membranes. 
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15. The method according to claim 12 wherein the translocating polypeptide is a 
VP22 polypeptide, Antp, or Protein H. 

16* The method according to claim 12 wherein the translocating polypeptide is a 
VP22 polypeptide. 

17. The method according to claim 12 wherein the regulatory agent is a 
polynucleotide, a protein or polypeptide, or a small molecule. 

1 8. The method according to claim 12 wherein the cell in culture is transfected 
with a polynucleotide con^rising the target gene. 

19. The method according to claim 14 wherein the regulatory elem^t is a 
promoter and translocation of the regulatory agent transactivates expression of the 
target gene product by the promoter. 

20. The method according to claim 19 wherein the regulatory agent is specific for 
the promoter. 

2 1 . The method according to claim 20 wherein the regulatory agent is a 
polymerase specific for the promoter. 

22. The method according to claim 2 1 wherein the polymerase is T7 RNA 
polymerase and the promoter is a T7 promoter. 

23 . The method according to claim 12 wherein the regulatory agent is an HIV Rev 
protein and the regulatory element is the HIV Rev response element (RRE). 

24. The method according to claim 12 wherein the regulatory agent is a 
transcription factor specific for the regulatory element and translocation of the 
regulatory agent transactivates expression of the target gene product. 
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25. The method according to claim 12 wherein the regulatory agent and the 
translocating polypeptide are covalently attached, 

26. The method according to claim 12 wherein the regulatory agent and the 
translocating polypeptide are attached by a linker. 

27. The method according to claim 26 wherein the linker comprises one or more 
disulfide bonds, salicylhydroxamic acid (SHA), phenylboronic acid (PBA), a SHA- 
NKS ester, or a combination thereof. 

28. The method according to claim 12 wherein the translocating polypeptide and 
the regulatory agent are units of a fusion protein. 

29. The method according to claim 12 wherein the regulatory zgesat is a single 
chain antibody (sFv), 

30. The method according to claim 12 wherein the regulatory agent is a 
polynucleotide encoding a single chain antibody, 

3 1 . The metiiod according to claim 12 wherein the translocating polypeptide and 
the regulatory agent are covalently linked by a biotin-streptavidin complex or the E. 
Coli single stranded DNA binding protein. 

32. The method according to claim 12 wherein the cell line contains a single 
genomic recombination site aad a plasmid containing the target gene and a 
recombination site that pairs with the genomic recombination site, and wherein the 
one or more regulatory agents includes a recombinase specific for the paired 
recombination sites, and wherein translocation of the recombinase causes 
recombination between the paired recombination sit^ resulting in stable integration 
of the target gene into the genome of the cell at the genomic recombinase site. 
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33- The method according to claim 32 wherein the recombinase is Flp and the 
recombinase sites are frt recombination sites. 

34. The method according to claim 32 wherein the recombinase is Cre and the 
recombinase sites are lox recombination sites. 

35. The method according to claim 12 wherein the one or more regulatory 
elements includes a transcription-blocking sequence flanked by recombinase 
recombination sites and the regulatory agent is a recombinase specific for the 
recombination sites, wherein translocation of the recombinase causes recombination 
of the recombination sites, thereby modulating expression of the target gene product, 

36. The method according to claim 35 wherein the recombinase recombination 
sites are frt sites and the recombinase is Flp or the recombinase recombination sites 
are lox sites and the recombinase is Cre. 

37. The method according to claim 12 wherein the one or more regulatory agents 
include a single chain antibody specific for a component of the one or more regulatory 
elements, wh^ein translocation of the single chain antibody m to the cell and binding 
of the antibody to the component modulates expression of the target gene product. 

38. The method according to claim 12 wherein the target gene is a reporter gene. 

39. The method according to claim 12 wherein the target gene is contained within 
a polynucleotide that further encodes a protein tag. 

40. The method according to claim 12 wherein the target gene encodes a toxic 
protein, 

41 . The method according to claim 39 wherein the protein tag is a myc epitope, a 
fluorescent peptide, or a poly His tag, or a combination of any two or more thereof. 
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42. The method according to claim 12 wherein the contacting comprises mixing 
the mammalian or insect cell with an additional cell transfected with a polynucleotide 
that encodes the regulatory agent and the translocating polypeptide and the additional 
cell expresses the nucleotide to obtain the regulatory agent attached to the 
translocating polypeptide. 

43 . The method according to claim 42 wherein the additional cell is prokaryotic or 
eukaryotic. 

44. The method according to claim 12 wherein the contacting involves incubathig 
the cell line with a soluble protein lysate prepared jfrom an additional transfected cell 
that expresses one or more polynucleotides encoding the regulatory agent and the 
translocating polypeptide. 

45 . The method according to claim 44 wherein the regulatory agent and the 
translocating polypeptide are expressed by the additional cell as a fusion protein. 

46. The method according to claim 12 wherein the cell is refractory to other 
transfection techniques. 

47. The method according to claim 12 wherein the cell is a member of a cell 
population and expression of the target gene is induced in substantially all of the cell 
population. 

48. A vector comprising a polynucleotide encoding a cell process-modifying 
molecule attached to a translocating polypeptide. 

49. The vector according to claim 48 wherein the vector is has a nucleotide 
sequence according to SEQ ID N0:1. 

50* The vector according to claim 48 wherein the vector is has a nucleotide 
sequence according to SEQ ID NO:2 
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GACGGATCG<K3AGATCTCCCGATCCCCTATGGTCGACTCrCAGTACAATCTGCTCTGATGCC^ 

AAGCCAGTATCTGCTCCCTGCITGTGTOTTGGAGGTCGCTGAGT^ 

ACAAGGCAAGGCTTGACCGACAATTGCyVTGAAGAATCTQC^ 

ATGTAOSGGCCAGATATACGCGTTGACATTGATTATTGACTAGTTAro 

ATTAGTTCATAGCCCATATATGGAGTTCCGCOTTACATAACrTACGGTAAAT^ 

CCCAA€X3ACCCCCGCCCATTOACGT<3ATAATGACGTATGTTCC 

ATTGACGTCAATGGGTGGACTATTrACGGTAAACTGCCCACTI^ 

AAGTACGCCCCCTATTGACGTCAATCACGGTAAATOGCCCGCCTGGCATTATGCCCAG^^ 

TGGGACTTTCCTACrrnSGCAGTACOTCTACGTATTAGTCAT^ 

AGTACATCAATGGGCGTGGATAGOSGTTTGACTCSVCGGGGAT^ 

TGGGAGTTTGTTTTGGCaCCAAAATa^ACGGGACTTTCC^^ 

CAAATGGGCGGTAGGCGTGTACGGTGGGAGGTCTATATAAGCAGAGCTCrrC^^ 

CTGCTTACTGGCTTATCGAAATTAATACGACTCACTATAGTC^ 

ATGACCTCTCGCCGCTCCGTGAAGTCGGGTCCGCGGGAGGTTCCGCGCGATGAGTACGAGGAT^ 

AmCCCCGTCTTCAGGTATGGCXSAGTCCCGATAGTCCGCCTGACACCTCCCGCCGT^ 

ACGCTCGCGCCAGAGGGGCGAGGTCCGTTTCGTCCyVGTACGACGAGTCGGArrATGCCCTCTACGGG^ 

TCGTCrrrCCGAAGACGACGAACACCCGGAGGTCCCCCGGACGCGGCGTCCCGTTTCCGGGGC^ 

CCGGCCOSGQGCCTGraCGGGCGCCTCKiXSCCACCCGCTGGGTCCGGAGGGGCCOT 

CGCrocCCGGGCCCCCaSAACCCy^GCGGGTGGCGTCTAAGGCCCCCGCGGCCCCGGCGGCGGAGACCA 

CGCGGCAGGAAATCGGCCCAGCCAGAATCC6CCXSCACTCCCAGAC?GCCCCCGCG^ 

GATCCAAGACATO:::Ga3CaiGGGGCTGGCCAGAAAGCTGC^ 

GCCATGGACCCCCCGGGTGGC<X;GCTTTAACAAGCX3CGTCTTCTGOT 

ATGC7mK:CCGGATGGCGGCTGTCC3lGCTCTX^ 

AA<™xrTTG6aiTCACXACC3.TCCGCGTG^ 

GTTGGTCAATCaUJAOSTGGTGaiGGACGTCGACGaSGCa 

CGCCCCACCGAGCX3ACCTCGAGCCCCAGCCC6CrrCCX3CrrTCr^^ 

AGCTOGGATCavCTAGTCCAGTGTGGTGGAATTCTGCAGA 

AGAGGGCCCGCGGTTCXSAACAAAAACTCATCTCAGAAGAC^ 

GCCCCTOTOCCGTGCCriTCCTTOACCCTOGAAGGl^ 
AATTGGATaSCATTGTCTGAGTAGGTCTCSV^^ 

GAGGATTGGGAAGACAATAGCAGGCATGCTGGGGATGCGGTGGGCTCTA 

CCAGCTGGGGCTCTAGGGGGTATCCCXIACGCGCCCTGTAGCGGCC^ 

TAOGaSCAGCGTGACCGCTACAgnXK:CAGC!GCCX?^^ 

CTOSCCSlCGTTCXSCCGGCTTTCCCCXlTCaAGCTCT 

CTTTAOGGCACCTCGACCCau^AAAACTTCATTAGGG 

GACGGTTTTTCGCCCTTTGACGTTGGAGTCCAaSTT^^ 

AO^CTay^CCCTATCTCGGTCTATTCTTT^ 

A2\AATGAGCTGATirCAACAAAAATTTAACGC3GAATTAAT^ 

AAGTCCCCAGGCTCCCCAGGCaGGCAaAAGTATGaUUlGCATGC^ 

GGAAAGTCfCCCMGCTCCCXaGCAGGCAGAAGTATGCAAAGCATC 

TCCaKrCCCTAACTCCGCCCATCCCGCCCCTAAerCCGCCCaWSOT 

ACTAATTTTTTTTATTTAOXSCAGAGGCCGAGGC^^ 

GGOTTTTTOSGAGGCCTAGGCOTTTC 

AAGAGACAGGATQAGCSATCGTTTCGCSVTGATTGAACAA 

GGTGGAGAGGCIMTCGGCTATGACTGGGCACAAC^^ 

CTGlXaWKX3CAGGGGCGCCXX3GTTCTTTTTGTCAA^ 

ACGAGGCAGCGOKSCTATCGTGGCTGGCCACGAC^^ 

TGAAGCGGGAAGGGACTOGCTGCTATTCGGCGAAGTGCCGGG^ 

CCTGCOGAiGAAAGTATCCaiTCATGGCrGATGCAATX^^ 

CATTCGACCACCAAGCGAAACATCGOVTOTAGOSa^^ 

GGATGATCTGGAOSAAGAGaSiTaVGGGGCTCGaSCCAGCCC^ 

CCaSACGGOGAGGATCTCGTCGTGACCCATGGCGATGCCrrGC^ 

GCTTTTCTGGATTCATCGACTCTGGCCGGCTGGGTCT^ 

CCGTGATATTGCTGAAGAGCTl^CGGCGAATGGGCTGACCGCTTCCTCG 

CCCGATTCGCAGCGCATCGCCTTCTATCGCCTTCTTGAOSAGTTCTTerGAGCGG^ 

GAAATGACCmCCAAGOSACGCCCAACCnXSCCATCACGAGATTTCGATTCCACC^ 

GGTTGGGCTTCGGAATCGTTTTCCGGGACGCaSGClXKSA 

GTTCTTCGCCCAOTCCAACTTGTTTATTOaGCOTATAAT^ 

TTCACAAATAAAGCATTTTTTTCACTGCATTCTAGl^ 

ATGTCKSTATACCGTCGACCTCTAGCrrAGAGCTTCGCGTAATCATGGTCAT^ 

TTGTTATCCGCTCACAATTCCmCACaiACATACGAGC^ 

TGAGTGAGCTAACTCACATTAATTGCGTTGCGCTaiCra 

AGCTGCATTAATGAATOXKICAACGCGCOGGGAGAGGCGGTTTGCGTA^^ 

GCTCACTGACTOSCTGCGCrrcGGTCGTTCGGCrGOGGC^ 

CGGTTATCCACAGAATCaGGGGATAACGCAGGAAAGAACATGTGAGCaAAATO 

ACCGTAAAAAGGCCGCGTTGCTCGCGTTTTTCCATAGGCTCCGCCCCCCl^ 
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ACGCTCftAGTCAGAGGTGGCGAAACXXX3ACAGGACTATAAAGATACCAGGC»TTTCCC^ 

CTCGTGCGCTCTCCTGTTCCGACCCrKK:C6CTmCCGGATACCT 

TGGCGC:OTTCTCAATGCTC».CGCTGTAGGTATCimGCT 

TGTGCACGAACCCCCCGTTCAGCCOGACOGCKKXSCCTTATCCGGTAAm 

GTAAGACACGACTTATaSCaVCrGGCAGCAGCCAC^^ 

GTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGGA 

TCTGCTGAAGCCAGTTACCrrTCGGAAAAAGAG'^^ 

AGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTAC^ 

TCTTTTCTACGGGGTCTGAaSCTCAGTGGAACGAAAACT 

AAAAAGGATCTTaVCCTAGATCCTTTTAAATTAAAAATGAAGTT^ 

TAAACTTGGTCTGACAGTTACXAA!rGCrPTAATC3lGTCAG^ 

CATCCATAGTTGCCTGACTCCCCGTOSTOTAGATAACTACGATACGGGAGGGC^ 

TGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATm 

AGGGCCG3«3CGaiGAAGTGGTCCrrGCAACTTTATCCGCCTC 

CTAGAGTAAGTAGTTaSCaGTTAATAGTTTGCGCAACGTlXSTT^C^^ 

AOXrrOGTCGTTTGGTATGGCTTCSlTT^^ 

ATGTTGTGCAAAAAAGOGGTTAGCTCCTTC^^ 

TAT»CTCATXKn*rATQGCaGCACTXK3lTAATT^^ 

GACTGGTGAGTACTCAACGAAGTOCrTCTGAGAATAGTGTATGC^ 

TCAATAaKK3ATAATACCXK:X30:ACATAGCAGAACT^ 

GGCGAAAACTCTCS^AGGATCTTACCGCTGTT^ 

ATCTTCAGCATCTTTTACTTTCACCAGC^^ 

AAGGGAAiaAGGGCSACACGGAAATGTTGAATACTCAaC^^ 

ATCAGGGTTATTGTCrCATGAGCSGATAaiTATTTGAATGTAT^ 

GCGCACATTTCCCCGAAAAGTGCCACCTGACGTC 
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GACGGATCGGGAGATCTCCCGATCCCCTATGGTCGACTCTCAGTACmTCTGCTCT^ 

AAGCCAGTATCTGCTCCCTGCETGTGTGTTGGAGGTCGCTGAGTA^ 

ACAAGGCAAGGCTTGACCGACSUVTTGCATGAAGAATCrGCTT^^ 

ATGTACGGGCCAGATATACGCGTTGACATTCATTATTGACTAGTTATTAATAGT^ 

ATTAGTTCATAGCCOiTATATGGAGTTCCGCXSTTACATAACTTACG^ 

CCCAACGACCCCOGCCCATTGACGTCaUVTAATGACGTATGTTCCCATAGT^ 

ATTGACGTOU^TGGGTGGACTATTTACGGTAAACriXSCCCACTT^ 

AAGTACGCCCCCTATTGAOSTCAATGACGQTAAATGGCCCXSCCTGGCAT^^ 

TGGGACTTTCCTACTTGGCAGTACATCrrACG^^ 

AGTACATCAATGGGOGTGGATAGCGGTTTOACTCACGGGGATT^^ 

TGGGAGTTTGTTTTGGCACCAAAATCSVACGGGACTTTC 

a^TGGGCGGTAGGCGTGTACGGTGGGAGGTCTATATAAGCAGAGCTCTC^^ 

Cn^CTTACTGGCTTATCGAAATTAATACGACTCACTAT^ 

ATGACCTCrrCGCCGCrCCGTCAAGTOGGGTCaSOGGGAGGTTCCGro 

ACACCCCGTCTTCAGGTATGGCGAGTCCCGATAGTCCGCCTOAaiCC^ 

ACGCTCGCGCCAGAGGGGCGAGGTCCGTTTCGTCCAGTACGACGAGTaSGATTATC 

TCGTCTTCCGAAGACGACGAACy^CCCGGAGGTCCCCOSGACXSCGGCGTCCCGTTTCC^^ 

CCGGCCCGGGGCCrGCGOKMaSCCTCajCCACCCGCTGGGTCaS^ 

CGCCCCCCGGGCCCCCCGAACXX:AGOGGGTGGCGACTAAGGrcCCCGCG^ 

CGOGGCAGGAAATCGGCCCAGCmGAATCCGCCGCACTCCOlGACGCCCCroaSTa^ 

GATCCAAGACACCCGCGCAGGGGCrGGCCAGAAAGCTGCACr^ 

GCCATGGACCCCCCGGGTGGCCGGCTTTAAa\AGC3Ga5a^^ 

ATGCATGCCOSGATGGOGGCGGTCCafiCTCK^^ 

AACTCCTTGGCATCACCACaTCCGOGTG^ 

GTTOGTGAATCCAGAOSTGGTGCAGGAOGTCGACGaSGCCA^ 

CGCCCCACCGAGOSACCTCGAGCCCCAGCCCGCrrCCGCTT^ 

AGCXCXKSATCa^CTAGTCCAGTGTGGTGGAATTGCCCTTAAGGGCA^^ 

GOSGCCGCTCXaGTCrrAGAGGGCCaSCXKmXXS^ 

TACCGGTCATCATCACCATCaCCATTGAGTTTAAACCroCT^ 



CCTAATAAAATGAGGAAATTGCATCGCATTGTCTGAGTAGGTGT^ 

GGAGGACAGO^AGGGGGAGGATTGGGAAGACAATAGCAGGCATC 

TCKSAGGCGGAAAGAACCaGCTGGGGCTCTAGGGGGrA^ 

CGGCXSGGTGTGGTGGTTACGCGCAGCGTGACrGCnACS^C^^ 

TTTCTTCCCTTCCITTCTCGCCACGTTC^ 

GGGTTCCGATTTAGTGCTTTACGGCACCTOGACCCCAAAAAAC^^ 

GGCCATCGCCCTGATAGAaSGTTTTTasreCTTT^ 

GTTCCAAACrrGGAACAACACTCaUVCCCT^ 

TCGGCCTATTCGTrAAAAAATGAGCTGATTTAAO^^ 

TGAGTTAGGGTGTGGAAAGTCCCXa^GGCTCCCCAGGCSWS^^ 

GTOVGOVACCAGGTGTGGAAAGTCCCCAGGCTCCCCAGCai^^ 

TAGTCAGCAACCATAGTCCCGCCCCTAACT^^ 

CTCCGCCCCATGGCTGACTAATTTTTTTTAlTTATGCAf^ 

CCAGAAGlAGTGAGGAGGCTTTTTTGGAGGCCrE^^ 

ATTTTCXKSATCreATCAAGAGACAGGATGA^ 

GTTCTCCGGCCGCTrGGGTGGAGAGGCTATTOSGCTA^ 

TGCCGCXSGTGTTCCGGCTGTCAGCGCAGGGGOGCC^^ 

CTCAATCAACTGCAGGACXSAGGCAGCGaSGCTAa^^ 

TGCTCGACGTTGTCACTGAAGCGGGAAGGGACTGGCnX^ 

GTCATCTCACCTTGCTCCriX3CXX3AGAAAGTATCCATa 

GATCaSGCTACCTCCCOOTCXSMCACaUlGCGAAACATCGC^ 

CCGGTCTTGTaSATCAGGATGATCTGGAOGAAG 

ATCGTGGAAAATGGCCGCT l " irr CTGGATTCATCX3ACTC 

ACATAGCGTTGGCTACCCGTGATATTGCTGAAGAGCTTGGOKSCGAATCG^ 

TTACGGTATCGCCGCTCCCGATTCGCAGCGCATCGCCTTCTATCGCC^^ 

GGACTCTGGGGTTCGOGAAATGACCGACCAAGCGACGCCXMCCTGCC^ 

GCCGCCTTCTATCAAAGGTTGGGCTTCGGAATCGTTT^ 

GGGATCTCATGCTGGAGTTCTTaSCCCACCCCAACTTGTTTAl^ 

CAATAGCATCACAAATTTCACSUUWTAAAGCaOTTTTT^ 

ATCAATGTATCTTATCATGTCIGTATACCGTaSACCTCT^ 

TGTTTCCTGTGTGAAATTGTTATCCGCTCACAATTC^ 

AGCCTGGGGTGCCTAATGAGTGAGCTAACrCACATTAArr^ 

GGAAACCOTTOSTGCCAGCTGCATTAATGAATCGGCCa^ 

GCrCTTCaSCTTCCrraSCTCACrrGACTCGCTGCGCT 

CTCAAAGGCGGTAATAOKSTTATCCACAGAATCAGGGGATAAC^ 

CAGCaAAAGGCCAGGAACCGTAAAAAGGCCGarrTGCTGGOSTT^ 
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AGCATCaCAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCX^ 

TCCCCCTOGAAGCTCXrCTCGTCCGClCTCCTOTTCCGACCCTGCCGC^ 

CTCCCTTCGGGAAGOGTGGCGCTTTCTCaU^TGCrrCATO 

GCTCCAAGCTCGGCnX3TGt<K:ACGAACCCCCCXfTT» 

TCTTGAGTCCAACCCGGTAAGACACGACTTATCGCaiCri^^ 

GCGAGGTATGTAGGCGGTGCTACa^GAGTTCTTGAiWn^GGTGC^ 

TATTlX3GTATCTGCGCTCrr<5CTGAAGCCaW3T^ 

ACAAACaVCCGCTGGTAGCGGTGGTTTTTTTGTTTG^^ 

CAMAAGATCCTTTGATCTTTTCTACGGGGTCl^ 

TGGTCATGAGAITATCAAAAAGGATCTTCACCTAGATCCTTTTJ^ 

CTAAAGTATATATGAGTAAACTTGGTCraACAGTTACCAATGCTTAAT^ 

ATCTOTCTATTTaSTTCATCCATAGTTGCCTGACrrCCCXIGTaSTO 

TACCATCn^CCCCAGTGCTGCaATGATACCGCmGACCCACGCr^ 

AAACCawSCCAGCCGGAAGGGCCGAGMCAGAAGTGGTCCTGCAAC^ 

AATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCftGT^^ 

CAGG«TanX3GTCTCACGCTCGTCGTr^ 

AGTTACAl^TCCCXICATGTTGTGCAAAAAAGCGGTT^ 

AAGTTOGCCGCAGTOTTATCa.CPC3^TGGTTATGGCAGCa 

TAAGATGCTTTTCTGTGACTGGTGAGTACTCAACaU«3^^ 

TTGCTCrrrGCCCGGCGTCAATA02K3GATA^T^^ 

GGAAAACGTTCTTOGGGGCGRAAACTC TCAAG G ATC^ 

CIOSTGCACXCAACTGATCTTCAGCATCTTTTACTTTCACC^^ 

GOUUVATGCCGOJUU^GGGAATAAGGGOSACACX^GA 

TATTATTGAAGOlTTTATCAGGGrrATTGTCTCA^ 

AACaUUO'AGGGGTTCCGCGCACSlTTTCCCXIGAAAAGTGCCACC^^ 
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SEQUENCE LISTING 

<110> INVITROGEN CORPORATION 
DALBY, Brian 
BENNETT, Robert 

<12 0> DELIVERY OF FUNCTIONAL PROTEIN SEQUENCES 
BY TRANSLOCATING POLYPEPTIDES 



<130> INVIT1280-1 

<150> PCT/USOO/08571 
<151> 2000-03-31 

<150> 60/127,467 
<a51> 1999-03-31 

<160> 21 

<170> FastSEQ for Windows Version 4,0 

<210> 1 
<211> 6404 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> vector pVP22/Myc-His 
<400> 1 

gacggatcgg gagatctccc gatcccctat ggtcgactct cagtacaatc tgctctgatg 60 

ccgcatagtt aagccagtat ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg 120 

cgagcaaaat ttaagctaca acaaggcaag gcttgaccga caattgcatg aagaatctgc ISO 

ttagggttag gcgttttgcg ctgcttcgcg atgtacgggc cagatatacg cgttgacatt 240 

gattattgac tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata 300 

tggagttccg cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc 360 

cccgcccatt gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc 420 

attgacgtca atgggtggac tatttacggt aaactgccca cttggcagta catcaagtgt 480 

atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt 540 

atgcccagta catgacctta tgggactttc ctacttggca gtacatctac gtattagtca 600 

tcgctattac catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg 660 

actcacgggg atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc 720 

aaaatcaacg ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg 780 

gtaggcgtgt acggtgggag gtctatataa gcagagctct ctggctaact agagaaccca 840 

ctgcttactg gcttatcgaa attaatacga ctcactatag ggagacccaa gctggctagt 900 

taagcttatt atgacctctc gccgctccgt gaagtcgggt ccgcgggagg ttccgcgcga 960 

tgagtacgag gatctgtact acaccccgtc ttcaggtatg gcgagtcccg atagtccgcc 1020 

tgacacctcc cgccgtggcg ccctacagac acgctcgcgc cagaggggcg aggtccgttt 1080 

cgtccagtac gacgagtcgg attatgccct ctacgggggc tcgtcttccg aagacgacga 1140 

acacccggag gtcccccgga cgcggcgtcc cgtttccggg gcggttttgt ccggcccggg 1200 

gcctgcgcgg gcgcctccgc cacccgctgg gtccggaggg gccggacgca cacccaccac 1260 

cgccccccgg gccccccgaa cccagcgggt ggcgtctaag gcccccgcgg ccccggcggc 132 0 

ggagaccacc cgcggcagga aatcggccca gccagaatcc gccgcactcc cagacgcccc 13 80 

cgcgtcgacg gcgccaaccc gatccaagac acccgcgcag gggctggcca gaaagctgca 144 0 

ctttagcacc gcccccccaa accccgacgc gccatggacc ccccgggtgg ccggctttaa 1500 

caagcgcgtc ttctgcgccg cggtcgggcg cctggcggcc atgcatgccc ggatggcggc 1560 

tgtccagctc tgggacatgt cgcgtccgcg cacagacgaa gacctcaacg aactccttgg 1620 

catcaccacc atccgcgtga cggtctgcga gggcaaaaac ctgcttcagc gcgccaacga 1680 

gttggtgaat ccagacgtgg tgcaggacgt cgacgcggcc acggcgactc gagggcgttc 1740 
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tgcggcgtcg cgccccaccg agcgacctcg agccccagcc cgctccgctt ctcgccccag 1800 

acggcccgtc gagggtaccg agctcggatc cactagtcca gtgtggtgga attctgcaga 1860 

tatccagcac agtggcggcc gctcgagtct agagggcccg cggttcgaac aaaaactcat 1920 

ctcagaagag gatctgaata tgcataccgg tcatcatcac catcaccatt gagtttaaac 1980 

ccgctgatca gcctcgactg tgccttctag ttgccagcca tctgttgttt gcccctcccc 2040 

cgtgccttcc ttgaccctgg aaggtgccac tcccactgtc ctttcctaat aaaatgagga 2100 

aattgcatcg cattgtctga gtaggtgtca ttctattctg gggggtgggg tggggcagga 2160 

cagcaagggg gaggattggg aagacaatag caggcatgct ggggatgcgg tgggctctat 2220 

ggcttctgag gcggaaagaa ccagctgggg ctctaggggg tatccccacg cgccctgtag 2280 

cggcgcatta agcgcggcgg gtgtggtggt tacgcgcagc gtgaccgcta cacttgccag 234 0 

cgccctagcg cccgctcctt tcgctttctt cccttccttt ctcgccacgt tcgccggctt 2400 

tccccgtcaa gctctaaatc ggggcatccc tttagggttc cgatttagtg ctttacggca 2460 

cctcgacccc aaaaaacttg attagggtga tggttcacgt agtgggccat cgccctgata 2520 

gacggttttt cgccctttga cgttggagtc cacgttcttt aatagtggac tcttgttcca 258 0 

aactggaaca acactcaacc ctatctcggt ctattctttt gatttataag ggattttggg 2640 

gatttcggcc tattggttaa aaaatgagct gatttaacaa aaatttaacg cgaattaatt 2700 

ctgtggaatg tgtgtcagtt agggtgtgga aagtccccag gctccccagg caggcagaag 2760 

tatgcaaagc atgcatctca attagtcagc aaccaggtgt ggaaagtccc caggctcccc 282 0 

agcaggcaga agtatgcaaa gcatgcatct caattagtca gcaaccatag tcccgcccct 2880 

aactccgccc atcccgcccc taactccgcc cagttccgcc cattctccgc cccatggctg 2 940 

actaattttt tttatttatg cagaggccga ggccgcctct gcctctgagc tattccagaa 3 000 

gtagtgagga ggcttttttg gaggcctagg cttttgcaaa aagctcccgg gagcttgtat 3060 

atccattttc ggatctgatc aagagacagg atgaggatcg tttcgcatga ttgaacaaga 312 0 

tggattgcac gcaggttctc cggccgcttg ggtggagagg ctattcggct atgactgggc 3180 

acaacagaca atcggctgct ctgatgccgc cgtgttccgg ctgtcagcgc aggggcgccc 324 0 

ggttcttttt gtcaagaccg acctgtccgg tgccctgaat gaactgcagg acgaggcagc 3300 

gcggctatcg tggctggcca cgacgggcgt tccttgcgca gctgtgctcg acgttgtcac 33 60 

tgaagcggga agggactggc tgctattggg cgaagtgccg gggcaggatc tcctgtcatc 342 0 

tcaccttgct cctgccgaga aagtatccat catggctgat gcaatgcggc ggctgcatac 3480 

gcttgatccg gctacctgcc cattcgacca ccaagcgaaa catcgcatcg agcgagcacg 354 0 

tactcggatg gaagccggtc ttgtcgatca ggatgatctg gacgaagagc atcaggggct 3600 

cgcgccagcc gaactgttcg ccaggctcaa ggcgcgcatg cccgacggcg aggatctcgt 3660 

cgtgacccat ggcgatgcct gcttgccgaa tatcatggtg gaaaatggcc gcttttctgg 3720 

attcatcgac tgtggccggc tgggtgtggc ggaccgctat caggacatag cgttggctac 3780 

ccgtgatatt gctgaagagc ttggcggcga atgggctgac cgcttcctcg tgctttacgg 384 0 

tatcgccgct cccgattcgc agcgcatcgc cttctatcgc cttcttgacg agttcttctg 3900 

agcgggactc tggggttcgc gaaatgaccg accaagcgac gcccaacctg ccatcacgag 3960 

atttcgattc caccgccgcc ttctatgaaa ggttgggctt cggaatcgtt ttccgggacg 402 0 

ccggctggat gatcctccag cgcggggatc tcatgctgga gttcttcgcc caccccaact 4080 

tgtttattgc agcttataat ggttacaaat aaagcaatag catcacaaat ttcacaaata 4140 

aagcattttt ttcactgcat tctagttgtg gtttgtccaa actcatcaat gtatcttatc 4200 

atgtctgtat accgtcgacc tctagctaga gcttggcgta atcatggtca tagctgtttc 4260 

ctgtgtgaaa ttgttatccg ctcacaattc cacacaacat acgagccgga agcataaagt 4320 

gtaaagcctg gggtgcctaa tgagtgagct aactcacatt aattgcgttg cgctcactgc 4380 

ccgctttcca gtcgggaaac ctgtcgtgcc agctgcatta atgaatcggc caacgcgcgg 4440 

ggagaggcgg tttgcgtatt gggcgctctt ccgcttcctc gctcactgac tcgctgcgct 450 0 

cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 4560 

cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 4 62 0 

accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 4 68 0 

acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 474 0 

cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 4800 

acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcaatgctca cgctgtaggt 4860 

atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 4 92 0 

agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 4980 

acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 504 0 

gtgctacaga gttcttgaag tggtggccta actacggcta cactagaagg acagtatttg 5100 

gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 5160 

gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 522 0 

gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 5280 

acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga 5340 

tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 5400 
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ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 
catccatagt tgcctgactc cccgtcgtgt agataactac gatacgggag ggcttaccat 
ctggccccag tgctgcaatg ataccgcgag acccacgctc accggctcca gatttatcag 
caataaacca gccagccgga agggccgagc gcagaagtgg tcctgcaact ttatccgcct 
ccatccagtc tattaattgt tgccgggaag ctagagtaag tagttcgcca gttaatagtt 
tgcgcaacgt tgttgccatt gctacaggca tcgtggtgtc acgctcgtcg tttggtatgg 
cttcattcag ctccggttcc caacgatcaa ggcgagttac atgatccccc atgttgtgca 
aaaaagcggt tagctccttc ggtcctccga tcgttgtcag aagtaagttg gccgcagtgt 
tatcactcat ggttatggca gcactgcata attctcttac tgtcatgcca tccgtaagat 
gcttttctgt gactggtgag tactcaacca agtcattctg agaatagtgt atgcggcgac 
cgagttgctc ttgcccggcg tcaatacggg ataataccgc gccacatagc agaactttaa 
aagtgctcat cattggaaaa cgttcttcgg ggcgaaaact ctcaaggatc ttaccgctgt 
tgagatccag ttcgatgtaa cccactcgtg cacccaactg atcttcagca tcttttactt 
tcaccagcgt ttctgggtga gcaaaaacag gaaggcaaaa tgccgcaaaa aagggaataa 
gggcgacacg gaaatgttga atactcatac tcttcctttt tcaatattat tgaagcattt 
atcagggtta ttgtctcatg agcggataca tatttgaatg tatttagaaa aataaacaaa 
taggggttcc gcgcacattt ccccgaaaag tgccacctga cgtc 



5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6404 



<210> 2 
<211> 6420 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> vector pVP22/Myc-His-T0P0 
<400> 2 

gacggatcgg gagatctccc gatcccctat ggtcgactct cagtacaatc tgctctgatg 60 

ccgcatagtt aagccagtat ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg 120 

cgagcaaaat ttaagctaca acaaggcaag gcttgaccga caattgcatg aagaatctgc 180 

ttagggttag gcgttttgcg ctgcttcgcg atgtacgggc cagatatacg cgttgacatt 240 

gattattgac tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata 300 

tggagttccg cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc 360 

cccgcccatt gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc 420 

attgacgtca atgggtggac tatttacggt aaactgccca cttggcagta catcaagtgt 4 80 

atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt 540 

atgcccagta catgacctta tgggactttc ctacttggca gtacatctac gtattagtca 600 

tcgctattac catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg 660 

actcacgggg atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc 720 

aaaatcaacg ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg 780 

gtaggcgtgt acggtgggag gtctatataa gcagagctct ctggctaact agagaaccca 84 0 

ctgcttactg gcttatcgaa attaatacga ctcactatag ggagacccaa gctggctagt 900 

taagcttatt atgacctctc gccgctccgt gaagtcgggt ccgcgggagg ttccgcgcga 960 

tgagtacgag gatctgtact acaccccgtc ttcaggtatg gcgagtcccg atagtccgcc 102 0 

tgacacctcc cgccgtggcg ccctacagac acgctcgcgc cagaggggcg aggtccgttt 1080 

cgtccagtac gacgagtcgg attatgccct ctacgggggc tcgtcttccg aagacgacga 1140 

acacccggag gtcccccgga cgcggcgtcc cgtttccggg gcggttttgt ccggcccggg 1200 

gcctgcgcgg gcgcctccgc cacccgctgg gtccggaggg gccggacgca cacccaccac 1260 

cgccccccgg gccccccgaa cccagcgggt ggcgactaag gcccccgcgg ccccggcggc 132 0 

ggagaccacc cgcggcagga aatcggccca gccagaatcc gccgcactcc cagacgcccc 13 8 0 

cgcgtcgacg gcgccaaccc gatccaagac acccgcgcag gggctggcca gaaagctgca 1440 

ctttagcacc gcccccccaa accccgacgc gccatggacc ccccgggtgg ccggctttaa 1500 

caagcgcgtc ttctgcgccg cggtcgggcg cctggcggcc atgcatgccc ggatggcggc 1560 

ggtccagctc tgggacatgt cgcgtccgcg cacagacgaa gacctcaacg aactccttgg 1620 

catcaccacc atccgcgtga cggtctgcga gggcaaaaac ctgcttcagc gcgccaacga 1680 

gttggtgaat ccagacgtgg tgcaggacgt cgacgcggcc acggcgactc gagggcgttc 1740 

tgcggcgtcg cgccccaccg agcgacctcg agccccagcc cgctccgctt ctcgccccag 1800 

acggcccgtc gagggtaccg agctcggatc cactagtcca gtgtggtgga attgccctta 1860 

agggcaattc tgcagatatc cagcacagtg gcggccgctc gagtctagag ggcccgcggt 192 0 

tcgaacaaaa actcatctca gaagaggatc tgaatatgca taccggtcat catcaccatc 1980 

accattgagt ttaaacccgc tgatcagcct cgactgtgcc ttctagttgc cagccatctg 2040 
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ttgtttgccc ctcccccgtg ccttccttga ccctggaagg tgccactccc actgtccttt 2100 

cctaataaaa tgaggaaatt gcatcgcatt gtctgagtag gtgtcattct attctggggg 2160 

gtggggtggg gcaggacagc aagggggagg attgggaaga caatagcagg catgctgggg 2220 

atgcggtggg ctctatggct tctgaggcgg aaagaaccag ctggggctct agggggtatc 2280 

cccacgcgcc ctgtagcggc gcattaagcg cggcgggtgt ggtggttacg cgcagcgtga 2 340 

ccgctacact tgccagcgcc ctagcgcccg ctcctttcgc tttcttccct tcctttctcg 2400 

ccacgttcgc cggctttccc cgtcaagctc taaatcgggg catcccttta gggttccgat 2460 

ttagtgcttt acggcacctc gaccccaaaa aacttgatta gggtgatggt tcacgtagtg 2520 

ggccatcgcc ctgatagacg gtttttcgcc ctttgacgtt ggagtccacg ttctttaata 2580 

gtggactctt gttccaaact ggaacaacac tcaaccctat ctcggtctat tcttttgatt 2640 

tataagggat tttggggatt tcggcctatt ggttaaaaaa tgagctgatt taacaaaaat 2700 

ttaacgcgaa ttaattctgt ggaatgtgtg tcagttaggg tgtggaaagt ccccaggctc 2760 

cccaggcagg cagaagtatg caaagcatgc atctcaatta gtcagcaacc aggtgtggaa 2820 

agtccccagg ctccccagca ggcagaagta tgcaaagcat gcatctcaat tagtcagcaa 2 8 80 

ccatagtccc gcccctaact ccgcccatcc cgcccctaac tccgcccagt tccgcccatt 2940 

ctccgcccca tggctgacta atttttttta tttatgcaga ggccgaggcc gcctctgcct 3000 

ctgagctatt ccagaagtag tgaggaggct tttttggagg cctaggcttt tgcaaaaagc 3060 

tcccgggagc ttgtatatcc attttcggat ctgatcaaga gacaggatga ggatcgtttc 312 0 

gcatgattga acaagatgga ttgcacgcag gttctccggc cgcttgggtg gagaggctat 3180 

tcggctatga ctgggcacaa cagacaatcg gctgctctga tgccgccgtg ttccggctgt 3240 

cagcgcaggg gcgcccggtt ctttttgtca agaccgacct gtccggtgcc ctgaatgaac 33 00 

tgcaggacga ggcagcgcgg ctatcgtggc tggccacgac gggcgttcct tgcgcagctg 3360 

tgctcgacgt tgtcactgaa gcgggaaggg actggctgct attgggcgaa gtgccggggc 342 0 

aggatctcct gtcatctcac cttgctcctg ccgagaaagt atccatcatg gctgatgcaa 3480 

tgcggcggct gcatacgctt gatccggcta cctgcccatt cgaccaccaa gcgaaacatc 354 0 

gcatcgagcg agcacgtact cggatggaag ccggtcttgt cgatcaggat gatctggacg 3600 

aagagcatca ggggctcgcg ccagccgaac tgttcgccag gctcaaggcg cgcatgcccg 3660 

acggcgagga tctcgtcgtg acccatggcg atgcctgctt gccgaatatc atggtggaaa 3 720 

atggccgctt ttctggattc atcgactgtg gccggctggg tgtggcggac cgctatcagg 378 0 

acatagcgtt ggctacccgt gatattgctg aagagcttgg cggcgaatgg gctgaccgct 3 84 0 

tcctcgtgct ttacggtatc gccgctcccg attcgcagcg catcgccttc tatcgccttc 3900 

ttgacgagtt cttctgagcg ggactctggg gttcgcgaaa tgaccgacca agcgacgccc 3 960 

aacctgccat cacgagattt cgattccacc gccgccttct atgaaaggtt gggcttcgga 402 0 

atcgttttcc gggacgccgg ctggatgatc ctccagcgcg gggatctcat gctggagttc 4080 

ttcgcccacc ccaacttgtt tattgcagct tataatggtt acaaataaag caatagcatc 4140 

acaaatttca caaataaagc atttttttca ctgcattcta gttgtggttt gtccaaactc 4200 

atcaatgtat cttatcatgt ctgtataccg tcgacctcta gctagagctt ggcgtaatca 4260 

tggtcatagc tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga 432 0 

gccggaagca taaagtgtaa agcctggggt gcctaatgag tgagctaact cacattaatt 4380 

gcgttgcgct cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga 4440 

atcggccaac gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc 4500 

actgactcgc tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg 4560 

gtaatacggt tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc 4620 

cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc 4680 

ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga 474 0 

ctataaagat accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc 4800 

ctgccgctta ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcaa 4860 

tgctcacgct gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg 492 0 

cacgaacccc ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc 4980 

aacccggtaa gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga 5040 

gcgaggtatg taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact 5100 

agaaggacag tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt 5160 

ggtagctctt gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag 522 0 

cagcagatta cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg 5280 

tctgacgctc agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa 534 0 

aggatcttca cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata 5400 

tatgagtaaa cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg 5460 

atctgtctat ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata 5520 

cgggagggct taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg 5580 

gctccagatt tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct 5640 

gcaactttat ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt 5700 
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tcgccagtta atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc 5760 

tcgtcgtttg gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga 5820 

tcccccatgt tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt 5880 

aagttggccg cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc 5940 

atgccatccg taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa 6000 

tagtgtatgc ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca 6060 

catagcagaa ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca 6120 

aggatcttac cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct 6180 

tcagcatctt ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc 6240 

gcaaaaaagg gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa 6300 

tattattgaa gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt 6360 

tagaaaaata aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc 6420 



<210> 3 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Peptide linker 
<400> 3 

Leu Ala Arg Leu Leu Ala Arg Leu Leu Ala Arg Leu 
15 10 

<210> 4 
<211> 67 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Conserved sequence of steroid/ thyroid hormone 
receptor superfamily DNA-binding domain 

<221> VARIANT 
<222> (0) . . . (0) 

<223> Xaa is non-conserved amino acids within the 
DNA-binding domain 

<221> VARIANT 

<222> 7, 9, 11, 13, 22, 27, 58, 61, 66 

<223> amino acid residues that are almost universally 
conserved, but for which variations have been 
found in some identified hormone receptors 

<400> 4 



Cys 


Xaa 


Xaa 


Cys 


Xaa 


Xaa 


Asp 


Xaa 


Ala 


Xaa 


Gly 


Xaa 


Tyr 


Xaa 


Xaa 


Xaa 


1 








5 










10 










15 




Xaa 


Cys 


Xaa 


Xaa 


Cys 


Lys 


Xaa 


Phe 


Phe 


Xaa 


Arg 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 








20 










25 










30 






Xaa 


Xaa 


Cys 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Cys 


Xaa 


Xaa 


Xaa 


Lys 






35 










40 










45 








Xaa 


Xaa 


Arg 


Xaa 


Xaa 


Cys 


Xaa 


Xaa 


Cys 


Arg 


Xaa 


Xaa 


Lys 


Cys 


Xaa 


Xaa 




50 










55 










60 










Xaa 


Gly 


Met 





























65 

<210> 5 
<211> 34 
<212> DNA 



<213> Bacteriophage PI 



<400> 5 

ataacttcgt atagcataca ttatacgaag ttat 

<210> 6 
<211> 34 
<212> DNA 
<213> E. coli 

<400> 6 

ataacttcgt atagtataca ttatacgaag ttat 

<210> 7 
<211> 34 
<212> DNA 
<213> E. coli 

<400> 7 

acaacttcgt ataatgtatg ctatacgaag ttat 

<210> 8 
<211> 34 
<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 8 

gaagttccta ttctctagaa agtataggaa cttc 

<210> 9 
<211> 5 
<212> PRT 

.<213> Artificial Sequence 
<220> 

<223> peptide linker moiety 
<400> 9 

Gly Gly Gly Gly Ser 
1 5 

<210> 10 

<211> 5 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Peptide linker moiety; sequence can be repeated 
indefinit number of times 

<400> 10 

Gly Gly Gly Gly Ser 
1 5 

<210> 11 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Peptide linker moiety 
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<400> 11 

Gly Lys Ser Ser Gly Ser Gly Ser Glu Ser Lys Ser 
15 10 

<210> 12 
<211> 14 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Peptide linker moiety 
<400> 12 

Gly Ser Thr Ser Gly Ser Gly Lys Ser Ser Glu Gly Lys Gly 
15 10 

<210> 13 
<211> 18 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Peptide linker moiety 
<400> 13 

Gly Ser Thr Ser Gly Ser Gly Lys Ser Ser Glu Gly Ser Gly Ser Thr 

15 10 15 

Lys Gly 



<210> 14 
<211> 14 
<212> PRT 

<213> Artificial Sequence 
<220> 

<22 3> Peptide linker moiety 
<400> 14 

Gly Ser Thr Ser Gly Ser Gly Lys Ser Ser Glu Gly Lys Gly 
15 10 

<210> 15 
<211> 18 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Peptide linker moiety 
<400> 15 

Gly Ser Thr Ser Gly Ser Gly Lys Pro Gly Ser Gly Glu Gly Ser Thr 

15 10 15 

Lys Gly 



<210> 16 

<211> 14 

<212> PRT 

<213> Artificial 



Sequence 
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<220> 

<223> Peptide linker moiety 
<400> 16 

Glu Gly Lys Ser Ser Gly Ser Gly Ser Glu Ser Lys Glu Phe 
15 10 

<210> 17 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Peptide linker moiety 
<400> 17 

Ser Arg Ser Ser Gly 
1 5 

<210> 18 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<22 3> Peptide linker moiety 
<400> 18 

Ser Gly Ser Ser Cys 
1 5 

<210> 19 
<211> 27 
<212> PRT 

<213> Artificial Sequence 
<220> 

<2 23> Peptide linker moiety 

Mit^Gly^Arg Ser Gly Gly Gly Cys Ala Gly Asn Arg Val Gly Ser Ser 

1 5 10 15 

Leu Ser Cys Gly Gly Leu Asn Leu Gin Ala Met 
20 25 

<210> 20 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Peptide linker moiety 

<221> VARIANT 
<222> (0) . . . (0) 

<223> Xaa is (GmS)n, where m is 

from 2 to 4 and n is from 1 to 11. 



<400> 20 

Ala Met Xaa Ala Met 
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<210> 21 
<2ai> 16 
<212> PRT 

<213> Drosophila acanthoptera 
<400> 21 

Arg Gin lie Lys lie Trp Phe Gin Asn Arg Arg Met Lys Trp Lys Lys 
15 10 15 
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